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TRANSMEMBRANE SERINE PROTEASE OVEREXPRESSED IN 
5 OVARIAN CARCINOMA AND USES THEREOF 

BACKGROUND OF THE INVENTION 

1 0 Cross-Reference to Related Application 

This application is a continuation-in-part patent 
application and claims the benefit of priority under 35 USC§120 
of USSN 09/261,416, filed March 3, 1999. 

1 5 Field of the Invention 

The present invention relates generally to the fields of 
cellular biology and diagnosis of neoplastic disease. More 
specifically, the present invention relates to a transmembrane 
serine protease termed Tumor Associated Differentially-Expressed 

20 Gene- 12 (TADG-12), which is overexpressed in ovarian carcinoma. 

Description of the Related Art 

Tumor cells rely on the expression of a concert of 
proteases to be released from their primary sites and move to 
25 distant sites to inflict lethality. This metastatic nature is the result 
of an aberrant expression pattern of proteases by tumor cells and 
also by stromal cells surrounding the tumors [1-3]. For most 
tumors to become metastatic, they must degrade their 
surrounding extracellular matrix components, degrade basement 

1 
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membranes to gain access to the bloodstream or lymph system, 
and repeat this process in reverse fashion to settle in a secondary 
host site [3-6]. All of these processes rely upon what now appears 
to be a synchronized protease cascade. In addition, tumor cells 
5 use the power of proteases to activate growth and angiogenic 
factors that allow the tumor to grow progressively [1]. Therefore, 
much research has been aimed at the identification of tumor- 
associated proteases and the inhibition of these enzymes for 
therapeutic means. More importantly, the secreted nature and/or 
10 high level expression of many of these proteases allows for their 
detection at aberrant levels in patient serum, e.g. the prostate- 
specific antigen (PSA), which allows for early diagnosis of prostate 
cancer [7]. 

Proteases have been associated directly with tumor 
15 growth, shedding of tumor cells and invasion of target organs. 
Individual classes of proteases are involved in, but not limited to 
(1) the digestion of stroma surrounding the initial tumor area, (2) 
the digestion of the cellular adhesion molecules to allow 
dissociation of tumor cells; and (3) the invasion of the basement 
20 membrane for metastatic growth and the activation of both tumor 
growth factors and angiogenic factors. 

For many forms of cancer, diagnosis and treatment has 
improved dramatically in the last 10 years. However, the five 
year survival rate for ovarian cancer remains below 50% due in 
25 large part to the vague symptoms which allow for progression of 
the disease to an advanced stage prior to diagnosis [8]. Although 
the exploitation of the CA125 antigen has been useful as a marker 
for monitoring recurrence of ovarian cancer, it has not proven to 
be an ideal marker for early diagnosis. Therefore, new markers 
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that may be secreted or released from cells and which are highly 
expressed by ovarian tumors could provide a useful tool for the 
early diagnosis and for therapeutic intervention in patients with 
ovarian carcinoma. 
5 The prior art is deficient in the lack of the complete 

identification of the proteases overexpressed in carcinoma, 
therefore, deficient in the lack of a tumor marker useful as an 
indicator of early disease, particularly for ovarian cancers . 
Specifically, TADG-12, a transmembrane serine protease, has not 
10 been previously identified in either nucleic acid or protein form. 
The present invention fulfills this long-standing need and desire 
in the art. 



SUMMARY OF THE INVENTION 

15 

The present invention discloses TADG-12, a new 
member of the Tumor Associated Differentially-Expressed Gene 
(TADG) family, and a variant splicing form of TADG-12 (TADG- 
12V) that could lead to a truncated protein product. TADG-12 is a 

20 transmembrane serine protease overexpressed in ovarian 
carcinoma. The entire cDNA of TADG-12 has been identified (SEQ 
ID No. 1). This sequence encodes a putative protein of 454 amino 
acids (SEQ ID No. 2) which includes a potential transmembrane 
domain, an LDL receptor like domain, a scavenger receptor 

25 cysteine rich domain, and a serine protease domain. These 
features imply that TADG-12 is expressed at the cell surface, and 
it may be used as a molecular target for therapy or a diagnostic 
marker. 



3 
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In one embodiment of the present invention, there is 
provided a DNA fragment encoding a TADG-12 protein selected 
from the group consisting of: (a) an isolated DNA fragment which 
encodes a TADG-12 protein; (b) an isolated DNA fragment which 
hybridizes to isolated DNA fragment of (a) above and which 
encodes a TADG-12 protein; and (c) an isolated DNA fragment 
differing from the isolated DNA fragments of (a) and (b) above in 
codon sequence due to the degeneracy of the genetic code, and 
which encodes a TADG-12 protein. Specifically, the DNA fragment 
has a sequence shown in SEQ ID No. 1 or SEQ ID No. 3. 

In another embodiment of the present invention, there 
is provided a vector/host cell capable of expressing the DNA of the 
present invention. 

In yet another embodiment of the present invention, 
there is provided an isolated and purified TADG-12 protein 
encoded by DNA selected from the group consisting of: (a) isolated 
DNA which encodes a TADG-12 protein; (b) isolated DNA which 
hybridizes to isolated DNA of (a) above and which encodes a 
TADG-12 protein; and (c) isolated DNA differing from the isolated 
DNAs of (a) and (b) above in codon sequence due to the 
degeneracy of the genetic code, and which encodes a TADG-12 
protein. Specifically, the TADG-12 protein has an amino acid 
sequence shown in SEQ ID No. 2 or SEQ ID No. 4. 

In still yet another embodiment of the present 
invention, there is provided a method for detecting expression of a 
TADG-12 protein, comprising the steps of: (a) contacting mRNA 
obtained from the cell with the labeled hybridization probe; and 
(b) detecting hybridization of the probe with the mRNA. 
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The present invention further provides methods for 
diagnosing a cancer or other malignant hyperplasia by detecting 
the TADG-12 protein or mRNA disclosed herein. 

In still another embodiment of the present invention, 
5 there is provided a method of inhibiting expression of endogenous 
TADG-12 mRNA in a cell by introducing a vector into the cell, 
wherein the vector comprises a DNA fragment of TADG-12 in 
opposite orientation operably linked to elements necessary for 
expression. 

10 In still yet another embodiment of the present 

invention, there is provided a method of inhibiting expression of a 
TADG-12 protein in a cell by introducing an antibody directed 
against a TADG-12 protein or fragment thereof. 

In still yet another embodiment of the present 

15 invention, there is provided a method of targeted therapy by 
administering a compound having a targeting moiety specific for a 
TADG-12 protein and a therapeutic moiety. Specifically, the 
TADG-12 protein has an amino acid sequence shown in SEQ ID No. 
2 or SEQ ID No. 4. 

20 The present invention still further provides a method 

of vaccinating an individual against TADG-12 by inoculating the 
individual with a TADG-12 protein or fragment thereof. 
Specifically, the TADG-12 protein has an amino acid sequence 
shown in SEQ ID No. 2 or SEQ ID No. 4. The TADG-12 fragment 

25 includes the truncated form of TADG-12V peptide having a 
sequence shown in SEQ ID No. 8, and a 9-residue up to 12-residue 
fragment of TADG-12 protein. 

In yet another embodiment of the present invention, 
there is provided an immunogenic composition, comprising an 
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immunogenic fragment of a TADG-12 protein and an appropriate 
adjuvant. The TADG-12 fragment includes the truncated form of 
TADG-12V peptide having a sequence shown in SEQ ID No. 8, and a 
9-residue up to 12-residue fragment of TADG-12 protein. 

Other and further aspects, features, and advantages of 
the present invention will be apparent from the following 
description of the presently preferred embodiments of the 
invention given for the purpose of disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which 
will become clear, are attained and can be understood in detail, 
more particular descriptions of the invention briefly summarized 
above may be had by reference to certain embodiments thereof 
which are illustrated in the appended drawings. These drawings 
form a part of the specification. It is to be noted, however, that 
the appended drawings illustrate preferred embodiments of the 
invention and therefore are not to be considered limiting in their 
scope. 

Figure lA shows that the expected PGR product of 
approximately 180 bp and the unexpected PGR product of 
approximately 300 bp using the redundant serine protease 
primers were not amplified from normal ovary cDNA (Lane 1) but 
were found in abundance from ovarian tumor cDNA (Lane 2). The 
primer sequences for the PGR reactions are indicated by horizontal 
arrows. Figure IB shows that TADG-12 was subcloned from the 
180 bp band while the larger 300 bp band was designated TADG- 
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12V. The sequences were found to overlap for 180 bp (SEQ ID No. 
5 for nucleotide sequence, SEQ ID No. 6 for deduced amino acid 
sequence) with the 300 bp TADG-12V (SEQ ID No. 7 for nucleotide 
sequence, SEQ ID No. 8 for deduced amino acid sequence) having 
an additional insert of 133 bases. This insertion (vertical arrow) 
leads to a frame shift, which causes the TADG-12V transcript to 
potentially produce a truncated form of TADG-12 with a variant 
amino acid sequence. 

Figure 2 shows that Northern blot analysis for TADG- 
12 revealed three transcripts of 2.4, 1.6 and 0.7 kilobases. These 
transcripts were found at significant levels in ovarian tumors and 
cancer cell lines, but the transcripts were found only at low levels 
in normal ovary. 

Figure 3 shows an RNA dot blot (CLONTECH) probed 
for TADG-12. The transcript was detectable (at background 
levels) in all 50 of the human tissues represented with the 
greatest abundance of transcript in the heart. Putamen, amygdala, 
kidney, liver, small intestine, skeletal muscle, and adrenal gland 
were also found to have intermediate levels of TADG-12 
transcript. 

Figure 4 shows the entire cDNA sequence for TADG- 
12 (SEQ ID No, 1) with its predicted open reading frame of 45 4 
amino acids (SEQ ID No. 2). Within the nucleotide sequence, the 
Kozak's consensus sequence for the initiation of translation and 
the poly-adenylation signal are underlined. In the protein 
sequence, a potential transmembrane domain is boxed. The LDLR- 
A domain is underlined with a solid line. The SRCR domain is 
underlined with a broken line. The residues of the catalytic triad 
of the serine protease domain are circled, and the beginning of the 
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catalytic domain is marked with an arrow designated as a 
potential proteolytic cleavage site. The * represents the stop 
codon that terminates translation. 

Figure 5 A shows the 35 amino acid LDLR-A domain 
5 of TADG-12 (SEQ ID No. 13) aligned with other LDLR-A motifs 
from the serine protease TMPRSS2 (U75329, SEQ ID No. 14), the 
complement subunit C8 (P07358, SEQ ID No. 9), two LDLR-A 
domains of the glycoprotein GP300 (P98164, SEQ ID Nos. 11-12), 
and the serine protease matriptase (API 18224, SEQ ID No. 10). 

10 TADG-12 has its highest similarity with the other serine proteases 
for which it is 54% similar to TMPRSS2 and 53% similar to 
matriptase. The highly conserved cysteine residues are shown in 
bold type. Figure SB shows the SRCR domain of TADG-12 (SEQ ID 
No. 17) aligned with other domain family members including the 

15 human macrophage scavenger receptor (P21757, SEQ ID No. 16), 
human enterokinase (P98073, SEQ ID No. 19), bovine enterokinase 
(P21758, SEQ ID No. 15), and the serine protease TMPRSS2 (SEQ ID 
No. 18). Again, TADG-12 shows its highest similarity within this 
region to the protease TMPRSS2 at 43%. Figure 5C shows the 

20 protease domain of TADG-12 (SEQ ID No. 23) in alignment with 
other human serine proteases including protease M (U62801, SEQ 
ID No. 20), trypsinogen 1 (P07477, SEQ ID No. 21), plasma 
kallikrein (P03952, SEQ ID No. 22), hepsin (P05981, SEQ ID No. 25), 
and TMPRSS2 (SEQ ID No. 24). Cons represents the consensus 

25 sequence for each alignment. 

Figure 6 shows semi-quantitative PGR analysis that 
was performed for TADG-12 (upper panel) and TADG-12V (lower 
panel). The amplification of TADG-12 or TADG-12V was 
performed in parallel with PGR amplification of p-tubulin product 

8 
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as an internal control. The TADG-12 transcript was found to be 
overexpressed in 41 of 55 carcinomas. The TADG-12V transcript 
was found to be overexpressed in 8 of 22 carcinomas examined. 
Note that the samples in the upper panel are not necessarily the 
5 same as the samples in the lower panel. 

Figure 7 shows immunohislochemical staining of 
normal ovary and ovarian tumors which were performed using a 
polyclonal rabbit antibody developed to a TADG-12 specific 
peptide. No significant staining was detected in normal ovary 

10 (Figure 7A). Strong positive staining was observed in 22 of 2 9 
carcinomas examined. Figures 7B and 7C represent a serous and 
mucinous carcinoma, respectively. Both show diffuse staining 
throughout the cytoplasm of tumor cells while stromal cells 
remain relatively unstained. 

15 Figure 8 is a model to demonstrate the progression of 

TADG-12 within a cellular context. In normal circumstances, the 
TADG-12 transcript is appropriately spliced and the resulting 
protein is capable of being expressed at the cell surface where the 
protease may be cleaved to an active form. The role of the 

20 remaining ligand binding domains has not yet been determined, 
but one can envision their potential to bind other molecules for 
activation, internalization or both. The TADG-12V transcript, 
which occurs in some tumors, may be the result of mutation 
and/or poor mRNA processing may be capable of producing a 

25 truncated form of TADG-12 that does not have a functional 
protease domain. In addition, this truncated product may present 
a novel epitope at the surface of tumor cells. 



9 



BNSDOCID <WO 0052044A1 I > 



wo 00/52044 PCT/USOO/0561 2 

DETAILED DESCRIPTION OF THE INVENTION 



To examine the serine proteases expressed by ovarian 
cancers, a PGR based differential display technique was employed 
utilizing redundant PCR primers designed to the most highly 
conserved amino acids in these proteins [9]. As a result, a novel 
cell-surface, multi-domain serine protease, named Tumor 
Associated Differentially-expressed Gene-12 (TADG-12) was 
identified. TADG-12 appears to be overexpressed in many ovarian 
tumors. The extracellular nature of TADG-12 may render tumors 
susceptible to detection via a TADG-12 specific assay. In addition, 
a splicing variant of TADG-12, named TADG-12V, was detected at 
elevated levels in 35% of the tumors that were examined. TADG- 
12V encodes a truncated form of TADG-12 with an altered amino 
acid sequence that may be a unique tumor specific target for 
future therapeutic approaches. 

The TADG-12 cDNA is 2413 base pairs long (SEQ ID No. 
1) encoding a 454 amino acid protein (SEQ ID No. 2). A variant 
form, TADG-12V (SEQ ID No. 3), encodes a 294 amino acid protein 
(SEQ ID No. 4). The availability of the TADG-12 and/or TADG-12V 
gene opens the way for a number studies that can lead to various 
applications. For example, the TADG-12 and/or TADG-12V gene 
can be used as a diagnostic or therapeutic target in ovarian 
carcinoma and other carcinomas including breast, prostate, lung 
and colon. 

In accordance with the present invention there may be 
employed conventional molecular biology, microbiology, and 
recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Maniatis, 
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Fritsch & Sambrook, "Molecular Cloning: A Laboratory Manual 
(1982); "DNA Cloning: A Practical Approach," Volumes I and II 
(D.N. Glover ed. 1985); "Oligonucleotide Synthesis" (M.J. Gait ed. 
1984); "Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. 
(1985)]; "Transcription and Translation" [B.D. Hames & S.J. Higgins 
eds. (1984)]; "Animal Cell Culture" [RJ. Freshney, ed. (1986)]; 
"Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A 
Practical Guide To Molecular Cloning" (1984). 

Therefore, if appearing herein, the following terms 
shall have the definitions set out below. 

As used herein, the term "cDNA" shall refer to the DNA 
copy of the mRNA transcript of a gene. 

As used herein, the term "derived amino acid 
sequence" shall mean the amino acid sequence determined b y 
reading the triplet sequence of nucleotide bases in the cDNA. 

As used herein the term ^'screening a library" shall 
refer to the process of using a labeled probe to check whether, 
under the appropriate conditions, there is a sequence 
complementary to the probe present in a particular DNA library. 
In addition, "screening a library" could be performed by PCR. 

As used herein, the term "PCR" refers to the 
polymerase chain reaction that is the subject of U.S. Patent Nos. 
4,683,195 and 4,683,202 to Mullis, as well as other improvements 
now known in the art. 

The amino acid described herein are preferred to be in 
the "L" isomeric form. However, residues in the "D" isomeric form 
can be substituted for any L-amino acid residue, as long as the 
desired functional property of immunoglobulin-binding is retained 
by the polypeptide. NH2 refers to the free amino group present at 
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the amino terminus of a polypeptide. CDOH refers to the free 
carboxy group present at the carboxy terminus of a polypeptide. 
In keeping with standard polypeptide nomenclature, J BioL Chem,, 
243:3552-59 (1969), abbreviations for amino acid residues are 
known in the art. 

It should be noted that all amino-acid residue 
sequences are represented herein by formulae whose left and 
right orientation is in the conventional direction of amino- 
terminus to carboxy-terminus. Furthermore, it should be noted 
that a dash at the beginning or end of an amino acid residue 
sequence indicates a peptide bond to a further sequence of one or 
more amino-acid residues. 

A "replicon" is any genetic element (e.g., plasmid, 
chromosome, virus) that functions as an autonomous unit of DNA 
replication in vivo; i.e., capable of replication under its own 
control. 

A "vector" is a replicon, such as plasmid, phage or 
cosmid, to which another DNA segment may be attached so as to 
bring about the replication of the attached segment. 

A "DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in 
its either single stranded form, or a double-stranded helix. This 
term refers only to the primary and secondary structure of the 
molecule, and does not limit it to any particular tertiary forms. 
Thus, this term includes double-stranded DNA found, inter alia, in 
linear DNA molecules (e.g., restriction fragments), viruses, 
plasmids, and chromosomes. In discussing the structure herein 
according to the normal convention of giving only the sequence in 



12 



wo 00/52044 PCT/lJSOO/05612 

the 5* to 3' direction along the nontranscribed strand of DNA (i.e., 
the strand having a sequence homologous to the mRNA), 

An "origin of replication" refers to those DNA 
sequences that participate in DNA synthesis, 
5 A DNA "coding sequence" is a double-stranded DNA 

sequence which is transcribed and translated into a polypeptide in 
vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined 
by a start codon at the 5' (amino) terminus and a translation stop 

10 codon at the 3' (carboxyl) terminus. A coding sequence can 
include, but is not limited to, prokaryotic sequences, cDNA from 
eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., 
mammalian) DNA, and even synthetic DNA sequences. A 
polyadenylation signal and transcription termination sequence 

15 will usually be located 3' to the coding sequence. 

Transcriptional and translational control sequences are 
DNA regulatory sequences, such as promoters, enhancers, 
polyadenylation signals, terminators, and the like, that provide for 
the expression of a coding sequence in a host cell. 

20 A "promoter sequence" is a DNA regulatory region 

capable of binding RNA polymerase in a cell and initiating 
transcription of a downstream (3' direction) coding sequence. For 
purposes of defining the present invention, the promoter sequence 
is bounded at its 3' terminus by the transcription initiation site 

25 and extends upstream (5' direction) to include the minimum 
number of bases or elements necessary to initiate transcription at 
levels detectable above background. Within the promoter 
sequence will be found a transcription initiation site, as well as 
protein binding domains (consensus sequences) responsible for 
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the binding of RNA polymerase. Eukaryotic promoters often, but 
not always, contain "TATA" boxes and "CAT" boxes. Prokaryotic 
promoters contain Shine-Dalgarno sequences in addition to the - 1 0 
and -35 consensus sequences. 
5 An "expression control sequence" is a DNA sequence 

that controls and regulates the transcription and translation of 
another DNA sequence. A coding sequence is "under the control" 
of transcriptional and translational control sequences in a cell 
when RNA polymerase transcribes the coding sequence into 

10 mRNA, which is then translated into the protein encoded by the 
coding sequence. 

A "signal sequence" can be included near the coding 
sequence. This sequence encodes a signal peptide, N-terminal to 
the polypeptide, that communicates to the host cell to direct the 

15 polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell before 
the protein leaves the cell. Signal sequences can be found 
associated with a variety of proteins native to prokaryotes and 
eukaryotes. 

20 The term "oligonucleotide", as used herein in referring 

to the probe of the present invention, is defined as a molecule 
comprised of two or more ribonucleotides, preferably more than 
three. Its exact size will depend upon many factors which, in turn, 
depend upon the ultimate function and use of the oligonucleotide. 

25 The term "primer" as used herein refers to an 

oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under 
conditions in which synthesis of a primer extension product, which 
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is complementary to a nucleic acid strand, is induced, i.e., in the 
presence of nucleotides and an inducing agent such as a DNA 
polymerase and at a suitable temperature and pH. The primer 
may be either single-stranded or double-stranded and must be 
sufficiently long to prime the synthesis of the desired extension 
product in the presence of the inducing agent. The exact length of 
the primer will depend upon many factors, including temperature, 
source of primer and use the method. For example, for diagnostic 
applications, depending on the complexity of the target sequence, 
the oligonucleotide primer typically contains 15-25 or more 
nucleotides, although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" 
complementary to different strands of a particular target DNA 
sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. 
Therefore, the primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary 
nucleotide fragment may be attached to the 5' end of the primer, 
with the remainder of the primer sequence being complementary 
to the strand. Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, provided that the 
primer sequence has sufficient complementary with the sequence 
or hybridize therewith and thereby form the template for the 
synthesis of the extension product. 

As used herein, the terms "restriction endonucleases" 
and "restriction enzymes" refer to enzymes, each of which cut 
double-stranded DNA at or near a specific nucleotide sequence. 

A cell has been "transformed" by exogenous or 
heterologous DNA when such DNA has been introduced inside the 
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cell. The transforming DNA may or may not be integrated 
(covalently linked) into the genome of the cell. In prokaryotes, 
yeast, and mammalian cells for example, the transforming DNA 
may be maintained on an episomal element such as a plasmid. 
With respect to eukaryotic cells, a stably transformed cell is one in 
which the transforming DNA has become integrated into a 
chromosome so that it is inherited by daughter cells through 
chromosome replication. This stability is demonstrated by the 
ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cells containing the 
transforming DNA. A "clone" is a population of cells derived from 
a single cell or ancestor by mitosis. A "cell line" is a clone of a 
primary cell that is capable of stable growth in vitro for many 
generations. 

Two DNA sequences are "substantially homologous" 
when at least about 75% (preferably at least about 80%, and most 
preferably at least about 90% or 95%) of the nucleotides match 
over the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the 
sequences using standard software available in sequence data 
banks, or in a Southern hybridization experiment under, for 
example, stringent conditions as defined for that particular 
system. Defining appropriate hybridization conditions is within 
the skill of the art. See, e.g., Maniatis et al., supra; DNA Cloning, 
Vols. I & II, supra; Nucleic Acid Hybridization, supra. 

A "heterologous" region of the DNA construct is an 
identifiable segment of DNA within a larger DNA molecule that is 
not found in association with the larger molecule in nature. Thus, 
when the heterologous region encodes a mammalian gene, the 



16 



wo 00/52044 PCT/llSOO/05612 

gene will usually be flanked by DNA that does not flank the 
mammalian genomic DNA in the genome of the source organism. 
In another example, coding sequence is a construct where the 
coding sequence itself is not found in nature (e.g., a cDNA where 
the genomic coding sequence contains introns, or synthetic 
sequences having codons different than the native gene). Allelic 
variations or naturally-occurring mutational events do not give 
rise to a heterologous region of DNA as defined herein. 

The labels most commonly employed for these studies 
are radioactive elements, enzymes, chemicals which fluoresce 
when exposed to ultraviolet light, and others. A number of 
fluorescent materials are known and can be utilized as labels. 
These include, for example, fluorescein, rhodamine, auramine, 
Texas Red, AMCA blue and Lucifer Yellow. A particular detecting 
material is anti-rabbit antibody prepared in goats and conjugated 
with fluorescein through an isothiocyanate. 

Proteins can also be labeled with a radioactive element 
or with an enzyme. The radioactive label can be detected by any 
of the currently available counting procedures. The preferred 
isotope may be selected from ^H, i^c, 32p^ 35s, 36ci, 5iCr, 57Co, 58Co, 
59Fe, 90Y, 1251^ 1311, and is^Re. 

Enzyme labels are likewise useful, and can be detected 
by any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques. 
The enzyme is conjugated to the selected particle by reaction with 
bridging molecules such as carbodiimides, diisocyanates, 
glutaraldehyde and the like. Many enzymes which can be used in 
these procedures are known and can be utilized. The preferred 
are peroxidase, P-glucuronidase, (i-D-glucosidase, p-D- 
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galactosidase, urease, glucose oxidase plus peroxidase and alkaline 
phosphatase. U.S. Patent Nos. 3,654,090, 3,850,752, and 4,016,043 
are referred to by way of example for their disclosure of alternate 
labeling material and methods. 

A particular assay system developed and utilized in 
the art is known as a receptor assay. In a receptor assay, the 
material to be assayed is appropriately labeled and then certain 
cellular test colonies are inoculated with a quantitiy of both the 
label after which binding studies are conducted to determine the 
extent to which the labeled material binds to the cell receptors, I n 
this way, differences in affinity between materials can be 
ascertained. 

An assay useful in the art is known as a "cis/trans" 
assay. Briefly, this assay employs two genetic constructs, one of 
which is typically a plasmid that continually expresses a particular 
receptor of interest when transfected into an appropriate cell line, 
and the second of which is a plasmid that expresses a reporter 
such as luciferase, under the control of a receptor/ligand complex. 
Thus, for example, if it is desired to evaluate a compound as a 
ligand for a particular receptor, one of the plasmids would be a 
construct that results in expression of the receptor in the chosen 
cell line, while the second plasmid would possess a promoter 
linked to the luciferase gene in which the response element to the 
particular receptor is inserted. If the compound under test is an 
agonist for the receptor, the ligand will complex with the receptor, 
and the resulting complex will bind the response element and 
initiate transcription of the luciferase gene. The resulting 
chemiluminescence is then measured photometrically, and dose 
response curves are obtained and compared to those of known 
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ligands. The foregoing protocol is described in detail in U.S. Patent 
No. 4,981,784. 

As used herein, the term "host" is meant to include not 
only prokaryotes but also eukaryotes such as yeast, plant and 
5 animal cells. A recombinant DNA molecule or gene which encodes 
a human TADG-12 protein of the present invention can be used to 
transform a host using any of the techniques commonly known to 
those of ordinary skill in the art. Especially preferred is the use of 
a vector containing coding sequences for the gene which encodes a 

10 huma TADG-12 protein of the present invention for purposes of 
prokaryote transformation. Prokaryotic hosts may include E. coli, 
S, tymphimurium , Serratia marcescens and Bacillus subtilis. 
Eukaryotic hosts include yeasts such as Pichia pastoris, 
mammalian cells and insect cells. 

15 In general, expression vectors containing promoter 

sequences which facilitate the efficient transcription of the 
inserted DNA fragment are used in connection with the host. The 
expression vector typically contains an origin of replication, 
promoter(s), terminator(s), as well as specific genes which are 

20 capable of providing phenotypic selection in transformed cells. 
The transformed hosts can be fermented and cultured according to 
means known in the art to achieve optimal cell growth. 

The invention includes a substantially pure DNA 
encoding a TADG-12 protein, a strand of which DNA will hybridize 

25 at high stringency to a probe containing a sequence of at least 1 5 
consecutive nucleotides of the sequence shown in SEQ ID No. 1 or 
SEQ ID No. 3. The protein encoded by the DNA of this invention 
may share at least 80% sequence identity (preferably 85%, more 
preferably 90%, and most preferably 95%) with the amino acids 
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listed in SEQ ID No. 2 or SEQ ID No. 4. More preferably, the DNA 
includes the coding sequence of the nucleotides of Figure 4 (SEQ ID 
No. 1), or a degenerate variant of such a sequence. 

The probe to which the DNA of the invention 
hybridizes preferably consists of a sequence of at least 2 0 
consecutive nucleotides, more preferably 40 nucleotides, even 
more preferably 50 nucleotides, and most preferably 100 
nucleotides or more (up to 100%) of the coding sequence of the 
nucleotides listed in Figure 4 (SEQ ID No. 1) or the complement 
thereof. Such a probe is useful for detecting expression of TADG- 
12 in a human cell by a method including the steps of (a) 
contacting mRNA obtained from the cell with the labeled 
hybridization probe; and (b) detecting hybridization of the probe 
with the mRNA. 

This invention also includes a substantially pure DNA 
containing a sequence of at least 15 consecutive nucleotides 
(preferably 20, more preferably 30, even more preferably 50, and 
most preferably all) of the region from nucleotides 1 to 2413 of 
the nucleotides listed in SEQ ID No. 1, or of the region from 
nucleotides 1 to 2544 of the nucleotides listed in SEQ ID No. 3. The 
present invention also comprises antisense oligonucleotides 
directed against this novel DNA. Given the teachings of the 
present invention, a person having ordinary skill in this art would 
readily be able to develop antisense oligonucleotides directed 
against this DNA. 

By "high stringency" is meant DNA hybridization and 
wash conditions characterized by high temperature and low salt 
concentration, e.g., wash conditions of 65°C at a salt concentration 
of approximately 0.1 x SSC, or the functional equivalent thereof. 
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For example, high stringency conditions may include hybridization 
at about 42°C in the presence of about 50% formamide; a first 
wash at about 65°C with about 2 x SSC containing 1% SDS; followed 
by a second wash at about 65°C with about 0.1 x SSC. 

By "substantially pure DNA" is meant DNA that is not 
part of a milieu in which the DNA naturally occurs, by virtue of 
separation (partial or total purification) of some or all of the 
molecules of that milieu, or by virtue of alteration of sequences 
that flank the claimed DNA. The term therefore includes, for 
example, a recombinant DNA which is incorporated into a vector, 
into an autonomously replicating plasmid or virus, or into the 
genomic DNA of a prokaryote or eukaryote; or which exists as a 
separate molecule (e.g., a cDNA or a genomic or cDNA fragment 
produced by polymerase chain reaction (PGR) or restriction 
endonuclease digestion) independent of other sequences. It also 
includes a recombinant DNA which is part of a hybrid gene 
encoding additional polypeptide sequence, e.g., a fusion protein. 
Also included is a recombinant DNA which includes a portion of 
the nucleotides shown in SEQ ID No. 3 which encodes an 
alternative splice variant of TADG-12 (TADG-12V). 

The DNA may have at least about 70% sequence 
identity to the coding sequence of the nucleotides listed in SEQ ID 
No. 1 or SEQ ID No. 3, preferably at least 75% (e.g. at least 80%); 
and most preferably at least 90%. The identity between two 
sequences is a direct function of the number of matching or 
identical positions. When a subunit position in both of the two 
sequences is occupied by the same monomeric subunit, e.g., if a 
given position is occupied by an adenine in each of two DNA 
molecules, then they are identical at that position. For example, if 
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7 positions in a sequence 10 nucleotides in length are identical to 
the corresponding positions in a second 10-nucleotide sequence, 
then the two sequences have 70% sequence identity. The length 
of comparison sequences will generally be at least 50 nucleotides, 
5 preferably at least 60 nucleotides, more preferably at least 7 5 
nucleotides, and most preferably 100 nucleotides. Sequence 
identity is typically measured using sequence analysis software 
(e.g.. Sequence Analysis Software Package of the Genetics 
Computer Group, University of Wisconsin Biotechnology Center, 

10 1710 University Avenue, Madison, WI 53705). 

The present invention comprises a vector comprising a 
DNA sequence which encodes a human TADG-12 protein and the 
vector is capable of replication in a host which comprises, in 
operable linkage: a) an origin of replication; b) a promoter; and c) 

15 a DNA sequence coding for said protein. Preferably, the vector of 
the present invention contains a portion of the DNA sequence 
shown in SEQ ID No. 1 or SEQ ID No. 3. A "vector" may be defined 
as a replicable nucleic acid construct, e.g., a plasmid or viral 
nucleic acid. Vectors may be used to amplify and/or express 

20 nucleic acid encoding a TADG-12 protein. An expression vector is 
a replicable construct in which a nucleic acid sequence encoding a 
polypeptide is operably linked to suitable control sequences 
capable of effecting expression of the polypeptide in a cell. The 
need for such control sequences will vary depending upon the cell 

25 selected and the transformation method chosen. Generally, control 
sequences include a transcriptional promoter and/or enhancer, 
suitable mRNA ribosomal binding sites, and sequences which 
control the termination of transcription and translation. Methods 
which are well known to those skilled in the art can be used to 
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construct expression vectors containing appropriate 
transcriptional and translational control signals. See for example, 
the techniques described in Sambrook et al., 1989, Molecular 
Cloning: A Laboratory Manual (2nd Ed.), Cold Spring Harbor Press, 
5 N.Y. A gene and its transcription control sequences are defined as 
being "operably linked" if the transcription control sequences 
effectively control the transcription of the gene. Vectors of the 
invention include, but are not limited to, plasmid vectors and viral 
vectors. Preferred viral vectors of the invention are those derived 

10 from retroviruses, adenovirus, adeno-associated virus, SV40 virus, 
or herpes viruses. 

By a "substantially pure protein" is meant a protein 
which has been separated from at least some of those components 
which naturally accompany it. Typically, the protein is 

15 substantially pure when it is at least 60%, by weight, free from the 
proteins and other naturally-occurring organic molecules with 
which it is naturally associated in vivo. Preferably, the purity of 
the preparation is at least 75%, more preferably at least 90%, and 
most preferably at least 99%, by weight. A substantially pure 

20 TADG-12 protein may be obtained, for example, by extraction 
from a natural source; by expression of a recombinant nucleic acid 
encoding an TADG-12 polypeptide; or by chemically synthesizing 
the protein. Purity can be measured by any appropriate method, 
e.g., column chromatography such as immunoaffinity 

25 chromatography using an antibody specific for TADG-12, 
polyacrylamide gel electrophoresis, or HPLC analysis. A protein is 
substantially free of naturally associated components when it is 
separated from at least some of those contaminants which 
accompany it in its natural state. Thus, a protein which is 
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chemically synthesized or produced in a cellular system different 
from the cell from which it naturally originates will be, by 
definition, substantially free from its naturally associated 
components. Accordingly, substantially pure proteins include 
5 eukaryotic proteins synthesized in E. coli, other prokaryotes, or 
any other organism in which they do not naturally occur. 

In addition to substantially full-length proteins, the 
invention also includes fragments (e.g., antigenic fragments) of the 
TADG-12 protein. As used herein, "fragment," as applied to a 

10 polypeptide, will ordinarily be at least 10 residues, more typically 
at least 20 residues, and preferably at least 30 (e.g., 5 0) residues 
in length, but less than the entire, intact sequence. Fragments of 
the TADG-12 protein can be generated by methods known to those 
skilled in the art, e.g., by enzymatic digestion of naturally 

15 occurring or recombinant TADG-12 protein, by recombinant DNA 
techniques using an expression vector that encodes a defined 
fragment of TADG-12, or by chemical synthesis. The ability of a 
candidate fragment to exhibit a characteristic of TADG-12 (e.g., 
binding to an antibody specific for TADG-i2) can be assessed by 

20 methods described herein. Purified TADG-12 or antigenic 
fragments of TADG-12 can be used to generate new antibodies or 
to test existing antibodies (e.g., as positive controls in a diagnostic 
assay) by employing standard protocols known to those skilled in 
the art. Included in this invention are polyclonal antisera 

25 generated by using TADG-12 or a fragment of TADG-12 as the 
immunogen in, e.g., rabbits. Standard protocols for monoclonal 
and polyclonal antibody production known to those skilled in this 
art are employed. The monoclonal antibodies generated by this 
procedure can be screened for the ability to identify recombinant 
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TADG-12 cDNA clones, and to distinguish them from known cDNA 
clones. 

Further included in this invention are TADG-12 
proteins which are encoded at least in part by portions of SEQ ID 
5 No. 1 or SEQ ID No. 3, e.g., products of alternative mRNA splicing or 
alternative protein processing events, or in which a section of 
TADG-12 sequence has been deleted. The fragment, or the intact 
TADG-12 polypeptide, may be covalently linked to another 
polypeptide, e.g. which acts as a label, a ligand or a means to 

10 increase antigenicity. 

The invention also includes a polyclonal or monoclonal 
antibody which specifically binds to TADG-12. The invention 
encompasses not only an intact monoclonal antibody, but also an 
immunologically-active antibody fragment, e.g., a Fab or (Fab)2 

15 fragment; an engineered single chain Fv molecule; or a chimeric 
molecule, e.g., an antibody which contains the binding specificity 
of one antibody, e.g., of murine origin, and the remaining portions 
of another antibody, e.g., of human origin. 

In one embodiment, the antibody, or a fragment 

20 thereof, may be linked to a toxin or to a detectable label, e.g. a 
radioactive label, non-radioactive isotopic label, fluorescent label, 
chemiluminescent label, paramagnetic label, enzyme label, or 
colorimetric label. Examples of suitable toxins include diphtheria 
toxin, Pseudomonas exotoxin A, ricin, and cholera toxin. Examples 

25 of suitable enzyme labels include malate hydrogenase, 
staphylococcal nuclease, delta-5-steroid isomerase, alcohol 
dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose 
phosphate isomerase, peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease. 
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urease, catalase, glucose-6-phosphate dehydrogenase, 
glucoamylase, acetylcholinesterase, etc. Examples of suitable 
radioisotopic labels include ^H, ^^^I, ^^h, ^^P, ^^S, I'^C, etc. 

Paramagnetic isotopes for purposes of in vivo 
5 diagnosis can also be used according to the methods of this 
invention. There are numerous examples of elements that are 
useful in magnetic resonance imaging. For discussions on in vivo 
nuclear magnetic resonance imaging, see, for example, Schaefer et 
al., (1989) JACC 14, 472-480; Shreve et aL, (1986) Magn, Reson, 

10 Med. 3, 336-340; Wolf, Q L., (1984) Physiol. Chem. Phys. Med. 
NMR 16, 93-95; Wesbey et al., (1984) Physiol Chem. Phys. Med. 
NMR 16, 145-155; Runge et al., (1984) Invest, Radiol 19, 408-415. 
Examples of suitable fluorescent labels include a fluorescein label, 
an isothiocyalate label, a rhodamine label, a phycoerythrin label, a 

15 phycocyanin label, an allophycocyanin label, an ophthaldehyde 
label, a fluorescamine label, etc. Examples of chemiluminescent 
labels include a luminal label, an isoluminal label, an aromatic 
acridinium ester label, an imidazole label, an acridinium salt label, 
an oxalate ester label, a luciferin label, a luciferase label, an 

20 aequorin label, etc. 

Those of ordinary skill in the art will know of other 
suitable labels which may be employed in accordance with the 
present invention. The binding of these labels to antibodies or 
fragments thereof can be accomplished using standard techniques 

25 commonly known to those of ordinary skill in the art. Typical 
techniques are described by Kennedy et aL, (1976) Clin. Chim. 
Acta 70, 1-31; and Schurs et al., (1977) Clin. Chim. Acta 81, 1-40. 
Coupling techniques mentioned in the latter are the 
glutaraldehyde method, the periodate method, the dimaleimide 
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method, the m-maleimidobenzyl-N-hydroxy-succinimide ester 
method. All of these methods are incorporated by reference 
herein. 

Also within the invention is a method of detecting 
5 TADG-12 protein in a biological sample, which includes the steps 
of contacting the sample with the labeled antibody, e.g., 
radioactively tagged antibody specific for TADG-12, and 
determining whether the antibody binds to a component of the 
sample. 

10 As described herein, the invention provides a number 

of diagnostic advantages and uses. For example, the TADG-12 
protein disclosed in the present invention is useful in diagnosing 
cancer in different tissues since this protein is highly 
overexpressed in tumor cells. Antibodies (or antigen-binding 

15 fragments thereof) which bind to an epitope specific for TADG-12, 
are useful in a method of detecting TADG-12 protein in a biological 
sample for diagnosis of cancerous or neoplastic transformation. 
This method includes the steps of obtaining a biological sample 
(e.g., cells, blood, plasma, tissue, etc.) from a patient suspected of 

20 having cancer, contacting the sample with a labeled antibody (e.g., 
radioactively tagged antibody) specific for TADG-12, and detecting 
the TADG-12 protein using standard immunoassay techniques 
such as an ELISA. Antibody binding to the biological sample 
indicates that the sample contains a component which specifically 

25 binds to an epitope within TADG-12. 

Likewise, a standard Northern blot assay can be used 
to ascertain the relative amounts of TADG-12 mRNA in a cell or 
tissue obtained from a patient suspected of having cancer, in 
accordance with conventional Northern hybridization techniques 
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known to those of ordinary skill in the art. This Northern assay 
uses a hybridization probe, e.g. radiolabelled TADG-12 cDNA, 
either containing the full-length, single stranded DNA having a 
sequence complementary to SEQ ID No. 1 or SEQ ID No. 3, or a 
5 fragment of that DNA sequence at least 20 (preferably at least 30, 
more preferably at least 50, and most preferably at least 100 
consecutive nucleotides in length). The DNA hybridization probe 
can be labeled by any of the many different methods known to 
those skilled in this art. 
10 Antibodies to the TADG-12 protein can be used in an 

immunoassay to detect increased levels of TADG-12 protein 
expression in tissues suspected of neoplastic transformation. 
These same uses can be achieved with Northern blot assays and 
analyses. 

15 The present invention is directed to DNA fragment 

encoding a TADG-12 protein selected from the group consisting of: 

(a) an isolated DNA fragment which encodes a TADG-12 protein; 

(b) an isolated DNA fragment which hybridizes to isolated DNA 
fragment of (a) above and which encodes a TADG-12 protein; and 

20 (c) an isolated DNA fragment differing from the isolated DNA 
fragments of (a) and (b) above in codon sequence due to the 
degeneracy of the genetic code, and which encodes a TADG-12 
protein. Preferably, the DNA has the sequence shown in SEQ ID 
No. 1 or SEQ ID No. 3. More preferably, the DNA encodes a TADG- 

25 12 protein having the amino acid sequence shown in SEQ ID No. 2 
or SEQ ID No. 4. 

The present invention is also directed to a vector 
and/or a host cell capable of expressing the DNA of the present 
invention. Preferably, the vector contains DNA encoding a TADG- 
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12 protein having the amino acid sequence shown in SEQ ID No. 2 
or SEQ ID No. 4. Representative host cells include bacterial cells, 
yeast cells, mammalian cells and insect cells. 

The present invention is also directed to an isolated 
5 and purified TADG-12 protein coded for by DNA selected from the 
group consisting of: (a) isolated DNA which encodes a TADG-12 
protein; (b) isolated DNA which hybridizes to isolated DNA of (a) 
above and which encodes a TADG-12 protein; and (c) isolated DNA 
differing from the isolated DNAs of (a) and (b) above in codon 

10 sequence due to the degeneracy of the genetic code, and which 
encodes a TADG-12 protein. Preferably, the isolated and purified 
TADG-12 protein has the amino acid sequence shown in SEQ ID No. 
2 or SEQ ID No. 4. 

The present invention is also directed to a method of 

15 detecting expression of the TADG-12 protein described herein, 
comprising the steps of: (a) contacting mRNA obtained from the 
cell with the labeled hybridization probe; and (b) detecting 
hybridization of the probe with the mRNA. 

A number of potential applications are possible for the 

20 TADG-12 gene and gene product including the truncated product 
TADG-12V. 

In one embodiment of the present invention, there is 
provided a method for diagnosing a cancer by delecting a TADG- 
12 protein in a biological sample, wherein the presence or absence 
25 of a TADG-12 protein indicates the presence or absence of a 
cancer. Preferably, the biological sample is selected from the 
group consisting of blood, urine, saliva, tears, interstitial fluid, 
ascites fluid, tumor tissue biopsy and circulating tumor cells. Still 
preferably, the detection of TADG-12 protein is by means selected 
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from the group consisting of Northern blot, Western blot, PGR, dot 
blot, ELIZA sandwich assay, radioimmunoassay, DNA array chips 
and flow cytometry. Such method is used for detecting an ovarian 
cancer, breast cancer, lung cancer, colon cancer, prostate cancer 
5 and other cancers in which TADG-12 is overexpressed. 

In another embodiment of the present invention, there 
is provided a method for detecting malignant hyperplasia b y 
detecting a TADG-12 protein or TADG-12 mRNA in a biological 
sample. Further by comprising the TADG-12 protein or TADG-12 

10 mRNA to reference information, a diagnosis or a treatment can be 
provided. Preferably, PGR amplification is used for detecting 
TADG-12 mRNA, wherein the primers utilized are selected from 
the group consisting of SEQ ID Nos. 28-31. Still preferably, 
detection of a TADG-12 protein is by immunoaffinity to an 

15 antibody directed against a TADG-12 protein. 

In still another embodiment of the present invention, 
there is provided a method of inhibiting expression of endogenous 
TADG-12 mRNA in a cell by introducing a vector comprising a DNA 
fragment of TADG-12 in opposite orientation operably linked to 

20 elements necessary for expression. As a result, the vector 
produces TADG-12 antisense mRNA in the cell, which hybridizes to 
endogenous TADG-12 mRNA, thereby inhibiting expression of 
endogenous TADG-12 mRNA. 

In still yet another embodiment of the present 

25 invention, there is provided a method of inhibiting expression of a 
TADG-12 protein by introducing an antibody directed against a 
TADG-12 protein or fragment thereof. As a result, the binding of 
the antibody to the TADG-12 protein or fragment thereof inhibits 
the expression of the TADG-12 protein. 
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TADG-12 gene products including the truncated form 
can be used for targeted therapy. Specifically, a compound having 
a targeting moiety specific for a TADG-12 protein and a 
therapeutic moiety is administered to an individual in need of 
5 such treatment. Preferably, the targeting moiety is selected from 
the group consisting of an antibody directed against a TADG-12 
protein and a ligand or ligand binding domain that binds a TADG- 
12 protein. The TADG-12 protein has an amino acid sequence 
shown in SEQ ID No. 2 or SEQ ID No. 4. Still preferably, the 

10 therapeutic moiety is selected from the group consisting of a 
radioisotope, a toxin, a chemotherapeutic agent, an immune 
stimulant and a cytotoxic agent. Such method can be used for 
treating an individual having a disease selected from the group 
consisting of ovarian cancer, lung cancer, prostate cancer, colon 

15 cancer and other cancers in which TADG-12 is overexpressed. 

In yet another embodiment of the present invention, 
there is provided a method of vaccinating, or producing an 
immune response in, an individual against TADG-12 by inoculating 
the individual with a TADG-12 protein or fragment thereof. 

20 Specifically, the TADG-12 protein or fragment thereof lacks TADG- 
12 activity, and the inoculation elicits an immune response in the 
individual, thereby vaccinating the individual against TADG-12. 
Preferably, the individual has a cancer, is suspected of having a 
cancer or is at risk of getting a cancer. Still preferably, TADG-12 

25 protein has an amino acid sequence shown in SEQ ID No. 2 or SEQ 
ID No. 4, while TADG-12 fragment has a sequence shown in SEQ ID 
No. 8, or is a 9-residue fragment up to a 20-residue fragment. 
Examples of 9-residue fragment are shown in SEQ ID Nos. 35, 36, 
55, 56, 83, 84, 97, 98, 119, 120, 122, 123 and 136. 
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In Still yet another embodiment of the present 
invention, there is provided an immunogenic composition, 
comprising an immunogenic fragment of a TADG-12 protein and 
an appropriate adjuvant. Preferably, the immunogenic fragment 
5 of the TADG-12 protein has a sequence shovc^n in SEQ ID No. 8, or is 
a 9-residue fragment up to a 20-residue fragment. Examples of 9 - 
residue fragment are shov^n in SEQ ID Nos. 35, 36, 55, 56, 83, 84, 
97, 98, 119, 120, 122, 123 and 136. 

The following examples are given for the purpose of 
10 illustrating various embodiments of the invention and are not 
meant to limit the present invention in any fashion. 



EXAMPI.K 1 

Tissue collection and storage 

15 Upon patient hysterectomy, bilateral salpingo- 

oophorectomy, or surgical removal of neoplastic tissue, the 
specimen is retrieved and placed on ice. The specimen was then 
taken to the resident pathologist for isolation and identification of 
specific tissue samples. Finally, the sample was frozen in liquid 

20 nitrogen, logged into the laboratory record and stored at -80^C. 
Additional specimens were frequently obtained from the 
Cooperative Human Tissue Network (CHTN). These samples were 
prepared by the CHTN and shipped on dry ice. Upon arrival, these 
specimens were logged into the laboratory record and stored at - 

25 80°C. 

EXAMPLK 2 

mRNA Extr action and cDNA Svnthesis 

Sixty-nine ovarian tumors (4 benign tumors, 10 low 
malignant potential tumors and 55 carcinomas) and 10 normal 
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ovaries were obtained from surgical specimens and frozen in 
liquid nitrogen. The human ovarian carcinoma cell lines SW 62 6 
and Caov 3, the human breast carcinoma cell lines MDA-MB-231 
and MDA-MB-435S were purchased from the American Type 
5 Culture Collection (Rockville, MD). Cells were cultured to sub- 
confluency in Dulbecco's modified Eagle's medium, supplemented 
with 10% (v/v) fetal bovine serum and antibiotics. 

Extraction of mRNA and cDNA synthesis were carried 
out by the methods described previously [14-16]. mRNA was 

10 isolated by using a RiboSep mRNA isolation kit (Becton Dickinson 
Labware). In this procedure, poly A+ mRNA was isolated directly 
from the tissue lysate using the affinity chromatography media 
oligo(dT) cellulose. cDNA was synthesized with 5.0 |ig of mRNA by 
random hexamer priming using 1st strand cDNA synthesis kit 

1 5 (CLONTECH). 



EXAMPLE 3 

PCR with Redundant Primers and Cloning of TADG-12 cDNA 

Redundant primers, forward 5'- 

2 0 TGGGTIGTIACIGCIGCICA(CT)TG -3' (SEQ ID No. 26) and reverse 5 ' - 
A(AG)IA(AG)IGCIATITCITTICC-3' (SEQ ID No. 27), for the 
consensus sequences of amino acids surrounding the catalytic 
triad for serine proteases were used to compare the PCR products 
from normal and carcinoma cDNAs. The appropriate bands were 

25 ligated into Promega T-vector plasmid and the Ugation product 
was used to transform JM109 cells (Promega) grown on selection 
media. After selection of individual colonies, they were cultured 
and plasmid DNA was isolated by means of the Wizard miniprep 
DNA purification system (Promega). Nucleotide sequencing was 
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performed using PRISM Ready Reaction Dye Deoxy terminator 
cycle sequencing kit (Applied Biosystems). Applied Biosystems 
Model 373A DNA sequencing system was used for direct cDNA 
sequence determination. 
5 The original TADG-12 subclone was randomly labeled 

and used as a probe to screen an ovarian tumor cDNA library b y 
standard hybridization techniques [11,15]. The library was 
constructed in }lZAP using mRNA isolated from the tumor cells of a 
stage Ill/grade III ovarian adenocarcinoma patient. Three 
10 overlapping clones were obtained which spanned 23 15 
nucleotides. The final 99 nucleotides encoding the most 3* 
sequence including the poly A tail was identified by, homology 
with clones available in the GenBank EST database. 



15 EXAMPI.K 4 

Quantitative PCR 

The mRNA overexpression of TADG-12 was 
determined using a quantitative PCR. Quantitative PCR was 
performed according to the procedure as previously reported [16]. 

20 Oligonucleotide primers were used for: TADG-12, forward 5'- 
GAAACATGTCCTrGCrCTCG-3' (SEQ ID No. 28) and reverse 5'- 
ACTAACTTCCACAGCCTCCT-3' (SEQ ID No. 29); the variant TADG-12, 
forward 5'-TCCAGGTGGGTCTAGTTrCC-3' (SEQ ID No. 30), reverse 
5'-CTCTTrGGCITGTACTTGCr-3' (SEQ ID No. 31); p-tubulin, forward 

25 5'- CGCATCAACGTGTACTACAA -3' (SEQ ID No. 32) and reverse 5'- 
TACGAGCTGGTGGACTGAGA -3' (SEQ ID No. 33). p-tubulin was 
utilized as an internal control. The PCR reaction mixture consists 
of cDNA derived from 50 ng of mRNA, 5 pmol of sense and 
antisense primers for both the TADG-12 gene and the P-tubulin 
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gene, 200 ^imol of dNTPs, 5 ^iCi of a-''PdCTP and 0.25 unit of Taq 
DNA polymerase with reaction buffer (Promega) in a final volume 
of 25 |il. The target sequences were amplified in parallel with the 
p-tubulin gene. Thirty cycles of PGR were carried out in a Thermal 
5 Cycler (Perkin-Elmer Cetus). Each cycle of PGR included 3 0 
seconds of denaturation at 94%G, 30 seconds of annealing at 60%G 
and 30 seconds of extension at 72%G. The PGR products were 
separated on 2% agarose gels and the radioactivity of each PGR 
product was determined by using a Phospho Imager (Molecular 

10 Dynamics). The present study used the expression ratio (TADG- 
1 2/p-tubulin) as measured by phosphoimager to evaluate gene 
expression and defined the value at mean + 2SD of normal ovary 
as the cut-off value to determine overexpression. The student's t 
test was used for comparison of the mean values of normal ovary 

1 5 and tumors. 



EXAMPLE 5 

Sequencing of TADG- i 2/TADG-l 2V 

Utilizing a plasmid specific primer near the cloning 

20 site, sequencing reactions were carried out using PRISM^*^ Ready 
Reaction Dye Deoxy"^*^ terminators (Applied Biosystems cat# 
401384) according to the manufacturer's instructions. Residual 
dye terminators were removed from the completed sequencing 
reaction using a Gentri-sep'^'^ spin column (Princeton Separation 

25 cat.# GS-901). An Applied Biosystems Model 373A DNA 
Sequencing System was available and was used for sequence 
analysis. 
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EXAMPLE 6 

Antibody Production 

Polyclonal rabbit antibodies were generated by 
immunization of white New Zealand rabbits with a poly-lysine 
5 linked multiple antigen peptide derived from the TADG-12 
carboxy-terminal protein sequence NH^-WIHEQMERDLKT-COOH 
(WIHEQMERDLKT, SEQ ID No. 34). This peptide is present in full 
length TADG-12, but not TADG-12V. Rabbits were immunized 
with approximately 100 ^ig of peptide emulsified in Ribi adjuvant. 
10 Subsequent boost immunizations were carried out at 3 and 6 
weeks, and rabbit serum was isolated 10 days after the boost 
inoculations. Sera were tested by dot blot analysis to. determine 
affinity for the TADG-12 specific peptide. Rabbit pre-immune 
serum was used as a negative control. 

15 

EXAMPLE 7 

Northern Blot Analysis 

10 [ig of mRNA were loaded onto a 1% formaldehyde- 

agarose gel, electrophoresed and blotted on a Hybond-N+ nylon 
20 membrane (Amersham). ^^P-labeled cDN A probes were made by 

Prime-a-Gene Labeling System (Promega). The PGR products 

amplified by the same primers as above were used for probes. 

The blots were prehybridized for 30 min and hybridized for 6 0 

min at 68%C with ^^P-labeled cDNA probe in ExpressHyb 
25 Hybridization Solution (CLONTECH). Control hybridization to 

determine relative gel loading was performed with the p-tubulin 

probe. 
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Normal human tissues; spleen, thymus, prostate, testis, 
ovary, small intestine, colon and peripheral blood leukocyte, and 
normal human fetal tissues; brain, lung, liver and kidney (Human 
Multiple Tissue Northern Blot; CLONTECH) were also examined b y 
5 same hybridization procedure. 



EXAMPLE 8 

Immunohistochemistry 

Immunohistochemical staining was performed using a 

10 Vectastain Elite ABC Kit (Vector). Formalin fixed and paraffin 
embedded specimens were routinely deparaffinized and processed 
using microwave heat treatment in 0.01 M sodium citrate buffer 
(pH 6.0). The specimens were incubated with normal goat serum 
in a moist chamber for 30 minutes. TADG-12 peptide antibody 

15 was allowed to incubate with the specimens in a moisture 
chamber for 1 hour. Excess antibody was washed away with 
phosphate buffered saline. After incubation with biotinylated 
anti-rabbit IgG for 30 minutes, the sections were then incubated 
with ABC reagent (Vector) for 30 minutch. The final products 

20 were visualized using the AEC substrate system (DAKO) and 
sections were counterstained with hematoxylin before mounting. 
Negative controls were performed by using normal serum instead 
of the primary antibody. 

25 EXAMPLE 9 

Isolation of Catalvtic Domain Subclones of TADG-12 and TADG-12 
Variant 

To identify serine proteases that are expressed in 
ovarian tumors, redundant PCR primers designed to the conserved 
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regions of the catalytic triad of these enzymes were employed. A 
sense primer designed to the region surrounding the conserved 
histidine and an anti-sense primer designed to the region 
surrounding the conserved aspartate were used in PGR reactions 
5 with either normal ovary or ovarian tumor cDNA as template. I n 
the reaction with ovarian tumor cDNA, a strong product band of 
the expected size of approximately 180 bp was observed as well 
as an unexpected PGR product of approximately 300 bp which 
showed strong expression in some ovarian tumor cDNA's (Figure 

10 lA). Both of these PGR products were subcloned and sequenced. 
The sequence of the subclones from the 180bp band (SEQ ID No. 5) 
was found to be homologous to the sequence identified in the 
larger, unexpected band (SEQ ID No. 7) except that the larger band 
had an additional insert of 133 nucleotides (Figure IB). The 

15 smaller product of the appropriate size encoded for a protein 
sequence (SEQ ID No. 6) homologous to other known proteases 
while the sequence with the insertion (SEQ ID No. 8) encoded for a 
frame shift from the serine protease catalytic domain and a 
subsequent premature translational stop codon. TADG-12 variants 

20 from four individual tumors were also subcloned and sequenced. 
It was found that the sequence and insert to be identical. The 
genomic sequences for these cDNA derived clones were amplified 
by PGR, examined and found to contain potential AG/GT splice 
sites that would allow for the variant transcript production. 

25 

EXAMPLE 10 
Northern Blo t Analysis of TADG-12 Exp ression 

To examine transcript size and tissue distribution, the 
catalytic domain subclone was randomly labeled and used to 
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probe Northern blots representing normal ovarian tissue, ovarian 
tumors and the cancer cell lines SW626, CAOV3, HeLa, MD-MBA- 
435S and MD-MBA-231 (Figure 2). Three transcripts of 2.4, 1.6 
and 0.7 kilobases were observed. In blots of normal and ovary 
5 tumor the smallest transcript size 0.7 kb was lowly expressed in 
normal ovary while all transcripts (2.4, 1.6 and 0.7 kb) were 
abundantly present in serous carcinoma. In addition. Northern 
blots representing the normal human tissues spleen, thymus, 
prostate, testis, ovary, small intestine, colon and peripheral blood 

10 leukocyte, and normal human fetal tissues of brain, lung, liver and 
kidney were examined. The same three transcripts were found to 
be expressed weakly in all of these tissues (data not shown). A 
human p-tubulin specific probe was utilized as a control for 
relative sample loading. In addition, an RNA dot blot was probed 

15 representing 50 human tissues and determined that this clone is 
weakly expressed in all tissues represented (Figure 3). It was 
found most prominently in heart, with intermediate levels in 
putamen, amygdala, kidney, liver, small intestine, skeletal muscle, 
and adrenal gland. 

20 

EXAMPLE 11 

Sequencing and Characterization of TADG-12 

An ovarian tumor cDNA library constructed in X,ZAP 
was screened by standard hybridization techniques using the 
25 catalytic domain subclone as a probe. Two clones that overlapped 
with the probe were identified and sequenced and found to 
represent 2316 nucleotides. The 97 nucleotides at the 3' end of 
the transcript including the poly-adenylation signal and the poly 
(A) tail were identified by homology with clones available in 
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GenBank's EST database. This brought the total size of the 
transcript to 2413 bases (SEQ ID No. 1, Figure 4). Subsequent 
screening of GenBank*s Genomic Database revealed that TADG-12 
is homologous to a cosmid from chromosome 17. This cosmid has 
5 the accession number AC015555. 

The identified cDNA includes an open reading frame 
that would produce a predicted protein of 454 amino acids (SEQ ID 
No. 2), named Tumor Associated Differentially-Expressed Gene 1 2 
(TADG-12). The sequence has been submitted to the GenBank 

10 database and granted the accession # AF201380. Using homology 
alignment programs, this protein contains several domains 
including an amino-terminal cytoplasmic domain, a potential Type 
II transmembrane domain followed by a low-density lipoprotein 
receptor-like class A domain (LDLR-A), a scavenger receptor 

15 cysteine rich domain (SRCR), and an extracellular serine protease 
domain. 

As predicted by the '^'^Pred program, TADG-12 contains 
a highly hydrophobic stretch of amino acids that could serve as a 
potential transmembrane domain, which v/ould retain the amino 

20 terminus of the protein within the cytoplasm and expose the 
ligand binding domains and protease domain to the extracellular 
space. This general structure is consistent with other known 
transmembrane proteases including hepsin [17], and TMPRSS2 
[18], and TADG-12 is particularly similar in structure to the 

25 TMPRSS2 protease. 

The LDLR-A domain of TADG-12 is represented by the 
sequence from amino acid 74 to 108 (SEQ ID No. 13). The LDLR-A 
domain was originally identified within the LDL Receptor [19] as a 
series of repeated sequences of approximately 40 amino acids. 
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which contained 6 invariant cysteine residues and highly 
conserved aspartate and giutamate residues. Since that initial 
identification, a host of other genes have been identified which 
contain motifs homologous to this domain [20]. Several proteases 
5 have been identified which contain LDLR-A motifs including 
matriplase, TMPRSS2 and several complement components. A 
comparison of TADG-12 with other known LDLR-A domains is 
shown in Figure 5 A. The similarity of these sequences range from 
44 to 54% of similar or identical amino acids. 

10 In addition to the LDLR-A domain, TADG-12 contains 

another extracellular ligand binding domain with homology to the 
group A SRCR family. This family of protein domains typically is 
defined by the conservation of 6 cysteine resides within a 
sequence of approximately 100 amino acids [23]. The SRCR 

15 domain of TADG-12 is encoded by amino acids 109 to 206 (SEQ ID 
No. 17), and this domain was aligned with other SRCR domains and 
found to have between 36 and 43% similarity (Figure 5B). 
However, TADG-12 only has 4 of the 6 conserved cysteine 
residues- This is similar to the SRCR domain found in the protease 

20 TMPRSS2. 

The TADG-12 protein also includes a serine protease 
domain of the trypsin family of proteases. An alignment of the 
catalytic domain of TADG-12 with other known proteases is shown 
in Figure 5C. The similarity among these sequence ranges from 4 8 
25 to 55%, and TADG-12 is most similar to the serine protease 
TMPRSS2 which also contains a transmembrane domain, LDLR-A 
domain and an SRCR domain. There is a conserved amino acid 
motif (RIVGG) downstream from the SRCR domain that is a 
potential cleavage/activation site common to many serine 
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proteases of this family [25]. This suggests that TADG-12 is 
trafficked to the cell surface where the ligand binding domains are 
capable of interacting with extracellular molecules and the 
protease domain is potentially activated. TADG-12 also contains 
5 conserved cysteine residues (amino acids 208 and 243) which in 
other proteases form a disulfide bond capable of linking the 
activated protease to the other extracellular domains. 



EXAMPLE 12 

1 0 Quantitative PGR Characterization of the Alternative Transcript 

The original TADG-12 subclone was identified as 
highly expressed in the initial redundant-primer PGR experiment. 
The TADG-12 variant form (TADG-12V) with the insertion of 133 
bp was also easily detected in the initial experiment. To identify 

15 the frequency of this expression and whether or not the 
expression level between normal ovary and ovarian tumors was 
different, a previously authenticated semi-quantitative PGR 
technique was employed [16]. The PGR analysis co-amplified a 
product for p-tubulin with either a product specific to TADG-12 or 

20 TADG-12V in the presence of a radiolabelled nucleotide. The 
products were separated by agarose gel electrophoresis and a 
phosphoimager was used to quantitate the relative abundance of 
each PGR product. Examples of these PGR amplification products 
are shown for both TADG-12 and TADG-12V in Figure 6. Normal 

25 expression was defined as the mean ratio of TADG-12 (or TADG- 
12V) to P-tubulin +/- 2SD as examined in normal ovarian samples. 
For tumor samples, overexpression was defined as >2SD from the 
normal TADG- 1 2/p-tubulin or TADG- 12V/p-tubulin ratio. The 
results are summarized in Table 1 and Table 2. TADG-12 was 
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found to be overexpressed in 41 of 55 carcinomas examined while 
the variant form was present at aberrantly high levels in 8 of 2 2 
carcinomas. As determined by the student's t test, these 
differences were statistically significant (p < 0,05). 

5 

TABLE 1 

Frequency of Overexpression of TADG-12 in Ovarian Carcinoma 



Histology Type 


TADG-12 (%) 


Normal 


0/16 (0%) 


LMP-Serous 


3/6 (50%) 


LMP-Mucinous 


0/4 (0%) 


Serous Carcinoma 


23/29 (79%) 


Mucinous Carcinoma 


7/12 (58%) 


Endometrioid Carcinoma 


8/8 (100%) 


Clear Cell Carcinoma 


3/6 (50%) 


Benign Tumors 


3/4 (75%) 



1 0 Overexpression =more than two standard deviations above 

the mean for normal ovary 
LMP = low malignant potential tumor 
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TABLE 2 

Frequency of Overexpression of TADG-12V in Ovarian Carcinoma 



Histology Type 


TADG-12V (%) 


Normal 


0/10 (0%) 


LMP-Serous 


0/5 (0%) 


LMP-Mucinous 


0/3 (0%) 


Serous Carcinoma 


4/14 (29%) 


Mucinous Carcinoma 


3/5 (60%) 


Endometrioid Carcinoma 


1/3 (33%) 


Clear Cell Carcinoma 


N/D 



Overexpression =more than two standard deviations above 
5 the mean for normal ovary; LMP = low malignant potential tumor 

EXAMPLE 13 

Immunohistochemical Analysis of TADG-12 in Ovarian Tumor Cells 
10 In order to examine the TADG-12 protein, polyclonal 

rabbit anti-sera to a peptide located in the carboxy-terminal 
amino acid sequence was developed. These antibodies were used 
to examine the expression level of the TADG-12 protein and its 
localization within normal ovary and ovarian tumor cells by 
15 immuno-localization. No staining was observed in normal ovarian 
tissues (Figure 7A) while significant staining was observed in 2 2 
of 29 tumors studied. Representative tumor samples are shown in 
Figures 7B and 7C. It should be noted that TADG-12 is found in a 
diffuse pattern throughout the cytoplasm indicative of a protein in 
20 a trafficking pathway. TADG-12 is also found at the cell surface in 
these tumor samples as expected. It should be noted that the 

44 



BNSDOCID: <WO 0062044A1_I_> 



wo 00/52044 PCT/USOO/0561 2 

antibody developed and used for immunohistochemical analysis 
would not detect the TADG-12V truncated protein. 

The results of the immunohistochemical staining are 
summarized in Table 3. 22 of 29 ovarian tumors showed positive 
5 staining of TADG-12, whereas normal ovarian surface epithelium 
showed no expression of the TADG-12 antigen, 8 of 10 serous 
adenocarcinomas, 8 of 8 mucinous adenocarcinomas, 1 of 2 clear 
cell carcinomas, and 4 of 6 endometroid carcinomas showed 
positive staining, 

10 

TABLE 3 

Case Stage Histology Grade LN* TADG12 Prognosis 



1 




Normal ovary 






0- 




2 




Normal ovary 






0- 




3 




Normal ovary 






0- 




4 




Mucinous B 




ND 


0- 


Alive 


5 




Mucinous B 




ND 


1+ 


Alive 


6 


1 a 


Serous LMP 


Gl 


ND 


1+ 


Alive 


7 


1 a 


Mucinous LMP 


Gl 


ND 


1+ 


Alive 


8 


1 a 


Mucinous CA 


Gl 


ND 


1+ 


Alive 


9 


1 a 


Mucinous CA 


G2 


ND 


1+ 


Alive 


1 0 


1 a 


Endometrioid CA 


Gl 


M) 


0- 


Alive 


1 1 


Ic 


Serous CA 


Gl 


N 


1+ 


Alive 


1 2 


1 c 


Mucinous CA 


Gl 


N 


1+ 


Alive 


1 3 


Ic 


Mucinous CA 


Gl 


N 


2+ 


Alive 


1 4 


Ic 


Clear cell CA 


G2 


N 


0- 


Alive 


1 5 


Ic 


Clear cell CA 


G2 


N 


0- 


Alive 


1 6 


2c 


Serous CA 


G3 


N 


2+ 


Alive 


1 7 


3a 


Mucinous CA 


G2 


N 


2+ 


Alive 
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1 8 


3b 


Serous CA 




Gl 


ND 


1+ 


Alive 


1 9 


3c 


Serous CA 




Gl 


N 


0- 


Dead 


20 


3c 


Serous CA 




G3 


P 


1 + 


Alive 


2 1 


3c 


Serous CA 




G2 


p 


2+ 


Alive 


22 


3c 


Serous CA 




Gl 


P 


2+ 


Unknown 


23 


3c 


Serous CA 




G3 


ND 


2+ 


Alive 


24 


3c 


Serous CA 




G2 


N 


0 - 


Dead 


25 


3c 


Mucinous CA 




Gl 


p 


2+ 


Dead 


26 


3c 


Mucinous CA 




G2 


ISO 


14- 


T T n V n n \jj t\ 


27 


3c 


Mucinous CA 




G2 


N 


1 + 


Alive 


28 


3c 


Endometrioid 


CA 


Gl 


p 


1 + 




29 


3c 


Endometrioid 


CA 


G2 


N 


0- 


Alive 


30 


3c 


Endometrioid 


CA 


G2 


P 


1+ 


Dead 


3 1 


3c 


Endometrioid 


CA 


G3 


P 


1+ 


Alive 


32 


3c 


Clear Cell CA 




G3 


P 


2+ 


Dead 



LN*= Lymph Node: B = Benign; N = Negative; P = Positive; 



ND = Not Done 



5 EXAMPLE 14 

Peptide Ranking 

For vaccine or immune stimulation, individual 9-mers 
to 11-mers of the TADG-12 protein were examined to rank the 
binding of individual peptides to the top 8 haplotypes in the 
10 general population [Parker et al., (1994)]. The computer program 
used for this analysis can be found at <http://www- 
bimas.dcrt.nih.gov/molbio/hla_bind/>. Table 4 shows the peptide 
ranking based upon the predicted half-life of each peptide's 
binding to a particular HLA allele. A larger half-life indicates a 

46 



BNSDOCID: <WO 0052044A1J_> 



wo 00/52044 



PCT/USOO/05612 



10 



stronger association with that peptide and the particular HLA 
molecule. The TADG-12 peptides that strongly bind to an HLA 
allele are putative immunogens, and are used to innoculate an 
individual against TADG-12. 



TADG-12 peptide ranking 
HLA Type 

& Ranking Start 
HLA A0201 



TABLE 4 



Peptide 



Predicted 
Dissociation i ;2 



SEQ 
ID No. 





1 


40 


ILSLLPFEV 


685.783 


35 




2 


144 


AQLGFPSYV 


545.316 


36 




3 


225 


LLSQWPWQA 


63.342 . 


37 




4 


252 


WIITAAHCV 


43.992 


38 


15 


5 


356 


VLNHAAVPL 


36.316 


39 




6 


176 


LLPDDKVTA 


34.627 


40 




7 


1 3 


FSFRSLFGL 


31.661 


41 




8 


151 


YVSSDNLRV 


27.995 


42 




9 


436 


RVTSFLDWI 


21 .502 


43 


20 


1 0 


234 


SLQFQGYHL 


21.362 


44 




1 1 


1 8 1 


KVTALHHSV 


21.300 


45 




1 2 


183 


TALHHSVYV 


19.658 


46 




1 3 


41 1 


RLWKLVGAT 


18.494 


47 




1 4 


60 


LILALAIGL 


18.476 


48 


25 


1 5 


227 


SQWPWQASL 


17.977 


49 




1 6 


301 


RLGNDIALM 


1 1.426 


50 




1 7 


307 


ALMKLAGPL 


10.275 


51 




1 8 


262 


DLYLPKSWT 


9.837 


52 




1 9 


416 


LVGATSFGI 


9.001 


53 


30 


20 


54 


SLGIIALIL 


8.759 


54 
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HLA A0205 



1 218 

2 60 

3 35 
5 4 307 

5 271 

6 397 

7 227 

8 270 
10 9 56 

10 110 

11 181 

12 15 1 
1 3 356 

15 14 144 

15 13 

16 54 

1 7 234 

18 2 17 

20 1 9 41 1 

20 252 
HLA Al 

1 130 

2 8 
25 3 328 

4 3 

5 98 

6 346 

7 360 



IVGGNMSIiL 47.600 55 

LILALAIGL 35.700 48 

AVAAQILSL 28.000 56 

ALMKLAGPL 21.000 51 

IQVGLVSLL 19.040 57 

CQGDSGGPL 16.800 58 

SQWPWQASL 16.800 49 

TIQVGLVSL 14.000 59 

GIIALILAL 14.000 60 

RVGGQNAVL 14.000 61 

KVTALHHSV 12.000 45 

YVSSDNLRV 1 2 . 000 42 

VLNHAAVPL 11.900 39 

AQLGFPSYV 9.600 36 

FSFRSLFGL 7.560 41 

SLGIIALIL 7.000 54 

SLQFQGYHL 7.000 44 

RIVGGNMSL 7.000 62 

RLVJKLVGAT 6.000 47 

WIITAAHCV 6.000 38 

CSDDWKGHY 37.500 63 

AVEAPFSFR 9,000 64 

NSEENFPDG 2.700 65 

ENDPPAVEA 2.500 66 

DCKDGEDEY 2.500 67 

ATEDGGDAS 2.250 68 

AAVPLISNK 2.000 69 
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8 153 

9 182 

10 143 
1 1 259 

5 12 369 

1 3 278 

1 4 426 

15 32 

1 6 406 

10 17 329 

1 8 303 

19 127 

20 440 
HLA A24 

15 1 433 

2 263 

3 169 

4 2 17 

5 296 
20 6 1 6 

7 267 

8 8 1 

9 375 

10 110 
25 11 189 

1 2 60 

13 165 

1 4 27 1 

15 56 



SSDNLRVSS 1.500 70 

VTALHHSVY 1.250 71 

CAQLGFPSY 1.000 72 

CVYDLYLPK 1.000 73 

ICNHRDVYG 1.000 74 

LLDNPAPSH 1.000 75 

CAEVNKPGV 1.000 76 

DADAVAAQI 1.000 77 

VCQERRLWK 1.000 78 

SEENFPDGK 0.900 79 

GNDIALMKL 0.625 80 

KTMCSDDWK 0.500 81 

FLDWIHEQM 0.500 82 

VYTRVTSFL 2 80.000 83 

LYLPKSWTI 90.000 84 

EFVSIDHLL 42.000 85 

RIVGGNMSL 12.000 62 

KYKPKRLGN 12.000 86 

RSLFGLDDL 12.000 87 

KSWTIQVGL 1 1.200 88 

RSSFKCIEL 8.800 89 

VYGGIISPS 8.000 90 

RVGGQNAVL 8.000 91 

VYVREGCAS 7.500 92 

LILALAIGL 7.200 48 

QFREEFVSI 7.200 93 

IQVGLVSLL 7-200 57 

GIIALIIoAL 7.200 60 
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YANVACAQL 
CASGHWTL 
SSRIVGGNM 
KPKRLGNDI 
GPLTFNEMI 
CVRVGGQNA 

HSKYKPKRL 
RDVYGGIIS 



7.200 
7.200 
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6.000 
6.000 

200.000 

80.000 

80.000 
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40.000 
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24.000 

20.000 

20.000 
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12.000 

12.000 

12.000 

12.000 

12.000 

10.00 

8.000 

8.000 

5.000 
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51 
95 
39 
96 

97 
98 
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106 
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1 1 1 
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1 9 304 NDIALMKLA 3.750 1 52 

2 0 104 DEYRCVRVG 3.600 153 



5 Conclusion 

In this study, a serine protease was identified b y 
means of a PGR based strategy. By Northern blot, the largest 
transcript for this gene is approximately 2.4 kb, and it is found to 
be expressed at high levels in ovarian tumors while found at 

10 minimal levels in all other tissues examined. The full-length cDNA 
encoding a novel multi-domain, cell-surface serine protease was 
cloned, named TADG-12. The 454 amino acid protein contains a 
cytoplasmic domain, a type II transmembrane domain, an LDLR-A 
domain, an SRCR domain and a serine protease domain. Using a 

15 semi-quantitative PGR analysis, it was shown that TADG-12 is 
overexpressed in a majority of tumors studied. 
Immunohistochemical staining corroborates that in some cases 
this protein is localized to the cell-surface of tumor cells and this 
suggests that TADG-12 has some extracellular proteolytic 

20 functions. Interestingly, TADG-12 also has a variant splicing form 
that is present in 35% of the tumors studied. This variant mRNA 
would lead to a truncated protein that may provide a unique 
peptide sequence on the surface of tumor cells. 

This protein contains two extracellular domains which 

25 might confer unusual properties to this multidomain molecule. 
Although the precise role of LDLR-A function with regard to 
proteases remains unclear, this domain certainly has the capacity 
to bind calcium and other positively charged ligands [21,22]. This 
may play an important role in the regulation of the protease or 
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subsequent internalization of the molecule. The SRCR domain was 
originally identified within the macrophage scavenger receptor 
and functionally described to bind lipoproteins. Not only are SRCR 
domains capable of binding lipoproteins, but they may also bind to 

5 molecules as diverse as polynucleotides [23]. More recent studies 
have identified members of this domain family in proteins with 
functions that vary from proteases to cell adhesion molecules 
involved in maturation of the immune system [24]. In addition, 
TADG-12, like TMPRSS2 has only four of six cysteine residues 

0 conserved within its SRCR domain. This difference may allow for 
different structural features of these domains that confer unusual 
ligand binding properties. At this time, only the function of the 
CD6 encoded SRCR is well documented. In the case of CD6, the 
SRCR domain binds to the cell adhesion molecule ALCAM [23]. 

5 This mediation of cell adhesion is a useful starting point for future 
research on newly identified SRCR domains, however, the 
possibility of multiple functions for this domain can not be 
overlooked. SRCR domains are certainly capable of cell adhesion 
type interactions, but their capacity to bind other types of ligands 

0 should be considered. 

At this time, the precise role of TADG-12 remains 
unclear. Substrates have not been identified for the protease 
domain, nor have ligands been identified for the extracellular 
LDLR-A and SRCR domains. Figure 8 presents a working model of 

5 TADG-12 with the information disclosed in the present invention. 
Two transcripts are produced which lead to the production of 
either TADG-12 or the truncated TADG-12V proteins. Either of 
these proteins is potentially targeted to the cell surface. TADG-12 
is capable of becoming an activated serine protease while TADG- 
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12V is a truncated protein product that if at the cell surface may 
represent a tumor specific epitope. 

The problem with treatment of ovarian cancer today 
remains the inability to diagnose the disease at an early stage. 
5 Identifying genes that are expressed early in the disease process 
such as proteases that are essential for tumor cell growth [26] is 
an important step toward improving treatment. With this 
knowledge, it may be possible to design assays to detect the 
highly expressed genes such as the TADG-12 protease described 

10 here or previously described proteases to diagnose these cancers 
at an earlier stage. Panels of markers may also provide prognostic 
information and could lead to therapeutic strategies for individual 
patients. Alternatively, inhibition of enzymes such as proteases 
may be an effective means for slowing progression of ovarian 

15 cancer and improving the quality of patient life. Other features of 
TADG-12 and TADG-12V must be considered important to future 
research too. The extracellular ligand binding domains are natural 
targets for drug delivery systems. The aberrant peptide 
associated with the TADG-12V protein may provide an excellent 

20 target drug delivery or for immune stimulation. 
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Any patents or publications mentioned in this 
specification are indicative of the levels of those skilled in the art 
to which the invention pertains. These patents and publications 
are herein incorporated by reference to the same extent as if each 
5 individual publication was specifically and individually indicated 
to be incorporated by reference. 

One skilled in the art will readily appreciate that the 
present invention is well adapted to carry out the objects and 
obtain the ends and advantages mentioned, as well as those 

10 inherent therein. The present examples along with the methods, 
procedures, treatments, molecules, and specific compounds 
described herein are presently representative of preferred 
embodiments, are exemplary, and are not intended as limitations 
on the scope of the invention. Changes therein and other uses will 

15 occur to those skilled in the art which are encompassed within the 
spirit of the invention as defined by the scope of the claims. 
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WHAT IS CLAIMED IS: 

1. A DNA fragment encoding Tumor Associated 
Differentially-Expressed Gene-12 (TADG-12) protein selected from 

5 the group consisting of: 

(a) an isolated DNA fragment which encodes a 
TADG-12 protein; 

(b) an isolated DNA fragment which hybridizes to 
isolated DNA fragment of (a) above and which encodes a TADG-12 

10 protein; and 

(c) an isolated DNA fragment differing from the 
isolated DNA fragments of (a) and (b) above in codon sequence 
due to the degeneracy of the genetic code, and which encodes a 
TADG-12 protein. 

15 

2. The DNA fragment of claim 1, wherein said DNA 
fragment has the sequence selected from the group consisting of 
SEQ ID No. 1 and SEQ ID No. 3, 

20 3. The DNA fragment of claim 1, wherein said 

TADG-12 protein has the amino acid sequence selected from the 
group consisting of SEQ ID No. 2 and SEQ ID No. 4. 

4. A vector comprising the DNA fragment of claim 1 
25 and regulatory elements necessary for expression of the DNA in a 

cell, 

5. The vector of claim 4, wherein said DNA 
fragment encodes a TADG-12 protein having the amino acid 
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sequence selected from the group consisting of SEQ ID No. 2 and 
SEQ ID No. 4. 

6. A host cell transfected with the vector of claim 4, 
5 said vector expressing a TADG-12 protein. 

7. The host cell of claim 6, wherein said cell is 
selected from the group consisting of a bacterial cell, a mammalian 
cell, a plant cell and an insect cell. 

10 

8. The host cell of claim 7, wherein said bacterial 
cell is E, coli. 



9. An antisense oligonucleotide directed against the 
15 DNA fragment of claim 1. 



10. An isolated and purified TADG-12 protein coded 
for by DNA selected from the group consisting of: 

(a) isolated DNA which encodes a TADG-12 protein; 
20 (b) isolated DNA which hybridizes to isolated DNA of 

(a) above and which encodes a TADG-12 protein; and 

(c) isolated DNA differing from the isolated DNAs of 
(a) and (b) above in codon sequence due to the degeneracy of the 
genetic code, and which encodes a TADG-12 protein. 

25 

11. The isolated and purified TADG-12 protein of 
claim 10, wherein said TADG-12 protein has an amino acid 
sequence selected from the group consisting of SEQ ID No. 2 and 
SEQ ID No. 4. 
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12. A method for detecting expression of the TADG- 
12 protein of claim 10, comprising the steps of: 

(a) contacting mRNA obtained from a cell with a 
labeled hybridization probe; and 
5 (b) detecting hybridization of the probe with the 

mRNA. 

13. An antibody directed against the TADG-12 
protein of claim 10. 

10 

14. A method for diagnosing a cancer in an 
individual, comprising the steps of: 

(a) obtaining a biological sample from said 
individual; and 

15 (b) detecting a TADG-12 protein in said sample, 

wherein the presence of a TADG-12 protein in said sample is 
indicative of the presence of a cancer in said individual, wherein 
the absence of a TADG-12 protein in said sample is indicative of 
the absence of a cancer in said individual. 

20 

15. The method of claim 14, wherein said biological 
sample is selected from the group consisting of blood, urine, saliva, 
tears, interstitial fluid, ascites fluid, tumor tissue biopsy and 
circulating tumor cells. 

25 

16. The method of claim 14, wherein said detection 
of a TADG-12 protein is by means selected from the group 
consisting of Northern blot, Western blot, PGR, dot blot, ELIZA 
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sandwich assay, radioimmunoassay, DNA array chips and flow 
cytometry. 



17. The method of claim 14, wherein said cancer is 
5 selected from the group consisting of ovarian cancer, breast 

cancer, lung cancer, colon cancer, prostate cancer and other 
cancers in which TADG-12 is overexpressed. 

18. A method for detecting malignant hyperplasia in 
10 a biological sample, comprising the steps of: 

(a) isolating mRNA from said sample; and 

(b) detecting TADG-12 mRNA in said sample, 
wherein the presence of said TADG-12 mRNA in said sample is 
indicative of the presence of malignant hyperplasia, wherein the 

15 absence of said TADG-12 mRNA in said sample is indicative of the 
absence of malignant hyperplasia. 

19. Ihe method of claim 18, further comprising the 
step of comparing said TADG-12 mRNA to reference information, 

20 wherein said comparison provides a diagnosis of said malignant 
hyperplasia. 

20. The method of claim 18, further comprising the 
step of comparing said TADG-12 mRNA to reference information, 

25 wherein said comparison determines a treatment of said 
malignant hyperplasia. 

21. The method of claim 18, wherein said detection 
of TADG-12 mRNA is by PGR amplification. 
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22. The method of claim 21, wherein said PGR 
amplification uses primers selected from the group consisting of 
SEQ ID Nos. 28-31. 

23. The method of claim 18, wherein said biological 
sample is selected from the group consisting of blood, urine, saliva, 
tears, interstitial fluid, ascites fluid, tumor tissue biopsy and 
circulating tumor cells. 



1^ 24. A method for detecting malignant hyperplasia in 

a biological sample, comprising the steps of: 

(a) isolating protein from said sample; and 

(b) detecting a TADG-12 protein in said sample, 
wherein the presence of a TADG-12 protein in said sample is 

15 indicative of the presence of malignant hyperplasia, wherein the 
absence of a TADG-12 protein in said sample is indicative of the 
absence of malignant hyperplasia. 

25. The method of claim 24, further comprising the 
20 step of comparing said TADG-12 protein to reference information, 

vi/herein said comparison provides a diagnosis of said malignant 
hyperplasia. 

26. The method of claim 24, further comprising the 
25 step of comparing said TADG-12 protein to reference information, 

wherein said comparison determines a treatment of said 
malignant hyperplasia. 
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27. The method of claim 24, wherein said detection 
is by immunoaffinity to an antibody, wherein said antibody is 
directed against a TADG-12 protein. 

5 28. The method of claim 24, wherein said biological 

sample is selected from the group consisting of blood, urine, saliva, 
tears, interstitial fluid, ascites fluid, tumor tissue biopsy and 
circulating tumor cells. 

10 29. A method of inhibiting expression of endogenous 

TADG-12 mRNA in a cell, comprising the step of: 

introducing a vector into a cell, wherein said vector 
comprises a DNA fragment of TADG-12 in opposite orientation 
operably linked to elements necessary for expression, wherein 

15 expression of said vector in said cell produces TADG-12 antisense 
mRNA, wherein said TADG-12 antisense mRNA hybridizes to 
endogenous TADG-12 mRNA, thereby inhibiting expression of 
endogenous TADG-12 mRNA in said cell. 

20 30. A method of inhibiting expression of a TADG-12 

protein in a cell, comprising the step of: 

introducing an antibody into a cell, wherein said 
antibody is directed against a TADG-12 protein or fragment 
thereof, wherein binding of said antibody to said TADG-12 protein 
25 or fragment thereof inhibits expression of said TADG-12 protein. 

31. A method of targeted therapy to an individual, 
comprising the step of: 
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administering a compound to an individual, wherein 
said compound has a targeting moiety and a therapeutic moiety, 
wherein said targeting moiety is specific for a TADG-12 protein. 

5 32. The method of claim 31, wherein said targeting 

moiety is selected from the group consisting of an antibody 
directed against a TADG-12 protein and a ligand or ligand binding 
domain that binds a TADG-12 protein. 

10 33. The method of claim 32, wherein said TADG-12 

protein has an amino acid sequence selected from the group 
consisting of SEQ ID No. 2 and SEQ ID No. 4. 

34. The method of claim 31, wherein said 
15 therapeutic moiety is selected from the group consisting of a 

radioisotope, a toxin, a chemotherapeutic agent, an immune 
stimulant and a cytotoxic agent. 

35. The method of claim 31, wherein said individual 
20 suffers from a disease selected from the group consisting of 

ovarian cancer, lung cancer, prostate cancer, colon cancer and 
other cancers in which TADG-12 is overexpressed. 

36. A method of vaccinating an individual against 
25 TADG-12, comprising the step of inoculating the individual with a 

TADG-12 protein or fragment thereof, wherein said TADG-12 
protein or fragment thereof lacks TADG-12 activity, wherein said 
inoculation with said TADG-12 protein or fragment thereof elicits 
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an immune response in said individual, thereby vaccinating said 
individual against TADG-12. 



37. The method of claim 36, wherein said individual 
5 has a cancer, is suspected of having a cancer or is at risk of getting 

a cancer. 

38. The method of claim 36, wherein said TADG-12 
protein has an amino acid sequence selected from the group 
consisting of SEQ ID No. 2 and SEQ ID No, 4. 

10 

39. The method of claim 36, wherein said TADG-12 
fragment has a sequence shown in SEQ ID No. 8. 

40. The method of claim 36, wherein said TADG-12 
15 fragment is a 9-residue fragment selected from the group 

consisting of SEQ ID Nos. 35, 36, 55, 56, 83, 84, 97, 98, 119, 120, 
122, 123 and 136. 

41. An immunogenic composition, comprising an 
20 immunogenic fragment of a TADG-12 protein and an appropriate 

adjuvant. 

42. The immunogenic composition of claim 41, 
wherein said immunogenic fragment of a TADG-12 protein has a 
sequence shown in SEQ ID No. 8. 

25 

43. The immunogenic composition of claim 41, 
wherein said immunogenic fragment of a TADG-12 protein is a 9 - 
residue fragment selected from the group consisting of SEQ ID Nos. 
35, 36, 55, 56, 83, 84, 97, 98, 119, 120, 122, 123 and 136. 
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FIG. 1A 



TADG12 



1 TGGGTGGTGACGGCGGCGCACTGTGTTTATGACTTGTACCTCCCCAAGTCATGGACCATC 
W V V T A A (h) CVYDLYLPKSWTI 



61 CAGGTGGGTCTAGTTTCCCTGTTGGACAATCCAGCCCCATCCCACTTGGTGGAGAAGATT 
QVG LVS L liDNPAP SHX.VE K I 

( SBQ ID NO . 5 ) 

121 GTCTACCACAGCy^AGTAC/LAGCCAAAGAGGCTGGGCAACfiACATCGCCCTCCTA 
VY H S KY K PKRL GN Cd) I A L L 

^ (SEQ ID NO. 6) 



TADG12-V ^ 

1 GGGTGGTGACGGCGGCGCACTGTGTTTATG AGATTGTAGCTCCTAGAGAAAGGGCAGACA 
VVTAAHCVYE IVAPRERADR 

61 GAAGAGGAAGGAAGCTCCTGTGCTGGAGGAAACCCACAAAAATGAAAGGACCTAGACCTT 
RGRKLLCWRKPTKMKGPRPS 

121 CCCATAGCTAATTCCAGTGGACCATGTTATGGCAGATACAGG CTTGTACCTCCCCAAGTC 
H S * (SEQ ID NO. 8) 

181 ATGGACCATCCAGGTGGGTCTAGTTTCCCTGTTGGACAATCCAGCCCCATCCCACTTGGT 
241 GGAGAAGATTGTCTACCACAGCAAGTACAAGCCAAAGAGGCTGGGCAACGACATCGCCCT 
301 CCTAATCACTAGTGCGGCCGCCTGCAGG (SEQ ID NO . 7 ) 
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1 


2 


3 


4 


5 


6 


7 


8 




A 


• 


















B 




• 
















C 


• 




• 














D 










• 










E 


* 


• 


m 














F 




















G 




















H 


























A 


whole 


amydala 


caudate 


cere - 


cerebral 


frontal 


hippo - 


medulla 


brain 


nucleus 


bellum 


cortex 


lobe 


campus 


ofolongat 


a 


B 


occipital 
lobe 


putamen 


subst. 
nigra 


temporal 
lobe 


thalamus 


sub - 
thalamic 
nucleus 


spinal 
cord 




C 


heart' 


aorta 


skeletal 
muscle 


colon 


bladder 


uterus 


prostate 


stomach 


D 


testis 


ovary 


pancreas 


pituitary 
gland 


adrenal 
gland 


thyroid 
gland 


salivary 
gland 


mammary 
gland 


E 


kidney 


liver 


smalt 
intestine 


spleen 


thymus 


peripheral 
leukocyte 


lymph 
node 


bone 
marrow 


F 


appendix 


lung 


trachea 


placenta 












fetal 


fetal 


fetal 


fetal 


fetal 


fetal 


fetal 






G 


brain 


heart 


kidney 


liver 


spleen 


thymus 


lung 






H 


yeast 
total RNA 
100 ng 


yeast 
tRNA 
100 ng 


E.coli 
rRNA 
100 ng 


Exoli 
DNA 
100 ng 


Poly r(A) 
100 ng 


human 
Cbtl DNA 
100 ng 


human 
DNA 
too ng 


human 
DNA 
500 ng 



FIG. 3 
3/9 



BNSDOCID: <iWO_0052044A1J_> 



wo 00/52044 



PCT/USOO/05612 



1 CGGGAAAGGGCTGTGTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGG 
61 TGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATG 
121 TCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCT 

JT^ ENDPPAVEAPF13 

181 TCTCATTCCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATG 

SFRSLFGLIX)LKISPVAPDA33 
24 1 CAGATGCTGTTGCTGCACAGATCCTGTCACTGCTGCCATTTGAAGTTTTTTCCCAATCAT 

DAVAAQI LSLLPFEVFS | Q P S 53 
301 CGTCATTG GGGATCATTGCATTGATATTAGCACTGGCCATTGGTCTG GGCATCCACTTCG 

I S L GIIALILALAIGLI G I H F D 73 

361 ACTGCTCAGGGAAGTACAGATGTCGCTCATCCTTTAAGTGTATCGAGCTGATAACTCGAT 

CSGKYRCR5SFKCIELIT R C 93 
421 GTGACGGAGTCTCGGATTGCAAAGACGGGGAGGACGAGTACCGCTGTGTCCGGGTGGGTG 

DGVSDCKDGEDEY R ...C V R V G G 113 

481 GTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCTTCGTGGAAGACCATGTGCTCCGATG 

QNAVL QVFTAJ^SWKTM _C S D D 13 3 

54 1 "TCT GGAAGGGT CACTa^^^ 

WKGHYANVACAQLGFP .S „. „.Y V S 15 3 

6 0 1 *" gTt Fa gXt c cTc A G^^ 

SDNLRVSSLEGQF R E E F V_ S I 17 3 

6 6 1 "Tc G AT C AC CtTtTg 

DHLLPDDKVTALHHS V Y V R E 193 

721 "■ AGGG ATGT GC CTCT 

GCASGHVVTLQCTACGHRRG 213 

7 8 1 GCTaC AG C TC^^^ 

Y S S ^ IVGGNMSLLSQWPWQA 233 
84 1 CCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGGGGCTCTGTCATCACGCCCCTGTGGA 

SLQFQGYHLCGGSVITPLWI 253 
901 TCATCACTGCTGCACACTGTGTTTATGACTTGTACCTCCCCAAGTCATGGACCATCCAGG 

I T A CVYDLYLPKSWTIQV 273 

961 TGGGTCTAGTT'TCCCTGTTGGACAATCCAGCCCCATCCCACTTGGTGGAGAAGATTGTCT 

GLVSLLDN PAPSHLVEKIVY 293 
1021 ACCACAGCAAGTACAAGCCAAAGAGGCTGGGCAATGACATCGCCCTTATGAAGCTGGCCG 

HSKYKPKRL gQ DIALMKLAG 313 
1081 GGCCACTCACGTTCAATGAAATGATCCAGCCTCTGTGCCTGCCCAACTCTGAAGAGAACT 

PLTFNEMIQPVCLPNSEENF 333 
114 1 TCCCCGATGGAAAAGTGTGCTGGACGTCAGGATGGGGGGCCACAGAGGATGGAGGTGACG 

PDGKVCWTSGWGATEDGGDA 353 
12 01 CCTCCCCTGTCCTGAACCACGCGGCCGTCCCTTTGATTTCCAACAAGATCTGCAACCACA 

SPVLNHAAVPLISNKICNHR 373 
12 61 GGGACGTGTACGGTGGCATCATCTCCCCCTCCATGCTCTGCGCGGGCTACCTGACGGGTG 

DVYGGI I S PSMLCAGYLTGG 393 
1321 GCGTGGACAGCTGCCAGGGGGACAGCGGGGGGCCCCTGGTGTGTCAAGAGAGGAGGCTGT 

v d s c q g (5^ s ggplvcqerrlw 413 
138 1 ggaagttagtgggagcgaccXgctttggcatcggctgcgcagaggtgaacaagcctgggg 

KLVGATS FGIGCAEVNKPGV 433 

14 4 1 TGTAC ACCCGTGTCACCTCCTTCCTGGACTGGATCCACGAGCAGATGGAGAGAGACCTAA 

YTRVTSFLDWIHEQMERDLK 453 

15 01 AAACCTGAAGAGGAAGGGGACAAGTAGCCACCTGAGTTCCTGAGGTGATGAAGACAGCCC 

T * (SEQ ID NO- 2) 45 4 

15 61 GATCCTCCCCTGGACTCCCGTGTAGGAACCTGCACACGAGCAGACACCCTTGGAGCTCTG 

1621 AGTTCCGGCACCAGTAGCGGGCCCGAAAGAGGCACCCTTCCATCTGATTCCAGCACAACC 

1681 TTCAAGCTGCTTTTTGTTTTTTGTTTTTTTGAGGTGGAGTCTCGCTCTGTTGCCCAGGCT 

17 4 1 GGAGTGCAGTGGCGAAATACCCTGCTCACTGCAGCCTCCGCTTCCCTGGTTCAAGCGATT 

18 01 CTCTTGCCTCAGCTTCCCCAGTAGCTGGGACCACAGGTGCCCGCCACCACACCCAACTAA 
18 61 TTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCTGCTCTCAAACCCC 
1921 TGACCTCAAATGATGTGCCTGCTTCAGCCTCCCACAGTGCTGGGATTACAGGCATGGGCC 
198 1 ACCACGCCTAGCCTCACGCTCCTTTCTGATCTTCACTAAGAACAAAAGAAGCAGCAACTT 
2 04 1 GCAAGGGCGGCCTTTCCCACTGGTCCATCTGGTTTTCTCTCCAGGGTCTTGCAAAATTCC 
2101 TGACGAGATAAGCAGTTATGTGACCTCACGTGCAAAGCCACCAACAGCCACTCAGAAAAG 
2161 ACGCACCAGCCCAGAAGTGCAGAACTGCAGTCACTGCACGTTTTCATCTTTAGGGACCAG 
2 2 2 1 AACCAAACCC ACCCTTTCTACTTCCAAGACTTATTTTCACATGTGGGGAGGTTAATCTAG 
2 2 8 1 GAATGACTCGTTTAAGGCCTATTTTCATGATTTCTTTGTAGCATTTGGTGCTTGACGTAT 
2 34 1 TATTGTCCTTTGATTCCAAATAATATGTTTCCTTCCCTCAAAAAAAAAAAAAAAAAAAAA 
2 401 AAAAAAAAAAAAA (SEQ ID NO. 1) 
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CompcS 


CEG . , FVC 


AQTGRCVNRR 


LLCNGDNDCG 


DQSDGAH . C 


<SEQ 


ZD 


NO. 


9 ) 


Matr 


CPG . QFTC 


.RTGRCIRKE 


LRCD6WADCT 


DHSDEI.N.C 


(SEQ 


JD 


KO. 


10) 


Gp300-1 


CQQGYFKC 


QSEGQCIPSS 


WVCDQDQDCD 


DGSDERQDC 


(SEQ 


ID 


NO. 


11) 


Gp300-2 


CSSHQITC 


. SNGQCIPSE 


YRCDHVRDCP 


DGADE.NDC 


(SEQ 


XD 


NO. 


12 ) 


TADG12 


CSGK.YRC 


RSSFKCIELI 


TRCDGVSDCK 


DGEDEYR.C 


(SEQ 


ID 


NO. 


13 ) 


Tmprss2 


CSNSGIEC 


DSSGTCINPS 


NWCDGVSHCP 


GGEDENR . C 


(SEQ 


ID 


NO. 


14 ) 


Cons 


c c 


C 


c c 


DE C 
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BovEntk VRLVGGSGPH 

MacSR VRLVGGSGPH 

TADG12 VRVGG . . . QN 
Tinprss2 ' VRLYG. . . PN 

HuxnEntk VRFFNGTTNN 

Cons VR 
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MacSR VHKAAHFGQG 

TADG12 SSDNIiRVSSL 

Tmprss2 SSQGIVDDSG 

HuxnEntk NSSKPIFSTD 
Cons 



EGRVEI.FHE GQWGTVCDDR 
EGRVEI.LHS GQWGTICDDR 
AVLQVFTA. . ASWKTMCSDD 
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NGLVRFRIQ . SIWHTACAEN 
W C 

TGPIWLNEVF CFGK. .ESSI 
TGPIVJLNEVF CFGR. .ESSI 
EGQFREEFVS I .DHLLPDDK 
STSFMKI^NTS A.GNV. . ,DI 
GGPFVKLNTA PDGHLILTPS 



WELRGGLWC RSLGYKGVQS 
WEVRVGQWC RSLGYPGVQA 
WKGHYANVAC AQLGFP , SYV 
WNENYGRAAC RDMGYKNNFY 
WTTQISNDVC QLLGLGSG . . 
W C 

EECRIRQWGV R.ACS HOED A 
EECKIRQWGT R.ACSHSEDA 
VTAIiHHSVYV REGCASGHW 
YKKIiYHS... .DACSSKAW 

QQ CLQDSLI 

C 



BovEntk 


GVTCT 


(SEQ 


ID 


NO. 


15) 


MacSR 


GVTCT 


(SEQ 


ID 


NO. 


16) 


TADG12 


TLQCT 


(SEQ 


ID 


NO. 


17 ) 


Tznprss2 


SliRCIi 


(SEQ 


ID 


NO. 


18) 


HioxnEntk 


RI.QC. 


(SEQ 


ID 


NO. 


19 ) 


Cons 


C 
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ProM LWVLTAAHCK KPNL QVFLGKHNLR QRESSQEQSS WRAVIHPDY 

Tryl QWWSAGHCY KSRI QVRLGEHNIE VLEGNEQFIN AAKIIRHPQY 

Kal QWVLTAAHCF D.GLPLQDVW RIYSGILNLS DITKDTPFSQ IKEIIIHQNY 

TADG12 LWIITAAHCV . YDLYLPKSW TIQVGLV..S LLDNPAPSHL VEKIVYHSKY 

Tmprss2 EWIVTAAHCV EKPLNNPWHW TAFAGILRQS FMFYGA.GYQ VQKVISHPNY 

Heps DWVLTAAHCF PERNRVLSRW RVFAGAVAQA SPHGLQLG. . VQAWYHGGY 

Cons H A HC G H Y 

ProM DAAS HDQDIMLLRL ARPAKLSELI QPLPLERDCS ANT..TSCHI 

Tryl DRKT LNNDIMLIKL SSRAVXNARV STISLPTAPP ATG..TKCLI 

Kal KVSE GNHDIAIilKL QAPI^TEFQ KPICLPSKGD TSTIYTNCWV 

TADG12 KPKR L6NDIAI24KL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT 

Tmprss2 DSKT KNNDIAIWKL QKPLTFNDLV KPVCLPNPGM MLQPEQLCWI 

Heps LPFRDPNSEE NSNDIALVHL SSPLPLTEYI QPVCLPAAGQ ALVDGKICTV 

Cons DI L L L c 



ProM 
Tryl 
Kal 
TADG12 
Tmprss2 
Heps 
Cons 



LGWGKTAD . . 
SGWGNTASSG 
TGWGFSKEK . 
SGWGAT . EDG 
SGWGAT . EEK 
TGWGNT . QYY 
6WG 



GDFPDTIQCA 
ADYPDELQCL 
GEIQNILQKV 
GDASPVIiNHA 
GKTSEVLNAA 
GQQAGVLQEA 



YIHLVSREEC 
DAPVLSQAKC 
NIPLVTNEEC 
AVPlilSNKIC 
KVLLIETQRC 
RVPIISNDVC 
C 



EHA . . YPGQI 
EAS . . YPGKl 
QKR . YQDYKI 
NHRDVYGGII 
NSRYVYDNLI 
NGADFYGNQI 
I 



TQNMLCAGDE 
TSNMFCVGFL 
TQRMVCAGYK 
SPSMLCAGYL 
TPAMICAGFIi 
KPKMFCAGYP 
M C G 



ProM 


KY6KDSCQGD 


SGGPLVC 


(SEQ 


ID 


NO. 


20 ) 


Tryl 


EGGKDSCQGD 


SGGPWC 


<S£Q 


ID 


NO. 


21) 


Kal 


EGGKDACK6D 


SGGPLVC 


(SEQ 


ID 


NO. 


22 ) 


TADG12 : 


TGGVDSCQGD 


SGGPLVC 


(SEQ 


ID 


NO. 


23 ) 


Tinprss2 


QGNVDSCQGD 


SGGPLVT 


(SEQ 


ID 


NO. 


24) 


Heps 


EGGIDACQGD 


SGGPFVC 


(SEQ 


ID 


NO. 


25) 


Cons 


D C GD 


SGGP V 











FIG. 5C 



6/9 



wo 00/52044 



PCT/USOO/05612 




FIG. 6 



BNSDOCID: <WO__0052044A1J_: 



7/9 



wo 00/52044 



PCT/USOO/05612 




BNSDOCID <:WO OOS2044A1 I > 



wo 00/52044 



PCT/USOO/05612 




9/9 

BNSDOCtD: <WO 0052044A1_t_> 



wo 00/52044 



PCT/USOO/05612 



SEQUENCE LISTING 

<110> O'Brien, Timothy J. 

Underwood. Lowell J. 
<120> Transmembrane Serine Protease Overexpressed 

in Ovarian Carcinoma and Uses Thereof 
<130> D6192PCT 
<141> 2000-03-02 
<150> 09/261,416 
<151> 1999-03-03 
<160> 153 



<210> 1 

<211> 2413 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<223> entire cDNA sequence of TADG-12 gene 

<400> 1 



cgggaaaggg 
tcttccgtgg 
ggtggaggcg 
aaaatgatcc 
ggccttgatg 
tgctgcacag 
cgtcattggg 
atccacttcg 
tatcgagctg 
aggacgagta 
gtgttcacag 
tcactacgca 
gttcagataa 
tttgtgtcca 
ccactcagta 
tgcagtgcac 
ggtggaaaca 
gttccagggc 
tcatcactgc 
accatccagg 
cttggtggag 
gcaatgacat 
atgatccagc 
aaaagtgtgc 
cctcccctgt 
tgcaaccaca 
cgcgggctac 
ggcccctggt 
agctttggca 
tgtcacctcc 
aaacctgaag 
aagacagccc 
cagacaccct 
ggcacccttc 
ttgttttttt 



ctgtgtttat 
tgccatctac 
gagccggatg 
gcctgctgtt 
atttgaaaat 
atcctgtcac 
gatcattgca 
actgctcagg 
ataactcgat 
ccgctgtgtc 
ctgcttcgtg 
aatgttgcct 
cctcagagtg 
tcgatcacct 
tatgtgaggg 
agcctgtggt 
tgtccttgct 
taccacctgt 
tgcacactgt 
tgggtctagt 
aagattgtct 
cgcccttatg 
ctgtgtgcct 
tggacgtcag 
cctgaaccac 
gggacgtgta 
ctgacgggtg 
gtgtcaagag 
tcggctgcgc 
ttcctggact 
aggaagggga 
gatcctcccc 
tggagctctg 
catctgattc 
gaggtggagt 



gggaagccag 
atttttggga 
tcagaggtcc 
gaagccccct 
aagtcctgtt 
tgctgccatt 
ttgatattag 
gaagtacaga 
gtgacggagt 
cgggtgggtg 
gaagaccatg 
gtgcccaact 
agctcgctgg 
cttgccagat 
agggatgtgc 
catagaaggg 
ctcgcagtgg 
gcgggggctc 
gtttatgact 
ttccctgttg 
accacagcaa 
aagctggccg 
gcccaactct 
gatggggggc 
gcggccgtcc 
cggtggcatc 
gcgtgaacag 
aggaggctgt 
agaggtgaac 
ggatccacga 
caagtagcca 
tggactcccg 
agttccggca 
cagcacaacc 
ctcgctctgt 



taacactgtg 
ctcgggaatt 
tgaaatagtc 
tctcattccg 
gcaccagatg 
tgaagttttt 
cactggccat 
tgtcgctcat 
ctcggattgc 
gtcagaatgc 
tgctccgatg 
gggtttccca 
ciggggcagtt 
gacaaggtga 
ctctggccac 
gctacagctc 
ccctggcagg 
tgtcatcacg 
tgtacctccc 
gacaatccag 
gtacaagcca 
ggccactcac 
gaagagaact 
cacagaggat 
ctttgatttc 
atctccccct 
ctgccagggg 
ggaagttagt 
aagcctgggg 
gcagatggag 
cctgagttcc 
tgtaggaacc 
ccagtagcgg 
ttcaagctgc 
tgcccaggct 



gcctactatc 
atgaggtaga 
accatggggg 
atcgcttttt 
cagatgctgt 
tcccaatcat 
tggtctgggc 
cctttaagtg 
aaagacgggg 
cgtgctccag 
actggaaggg 
agctatgtga 
ccgggaggag 
ctgcattaca 
gtggttacct 
acgcatcgtg 
ccagccttca 
cccctgtgga 
caagtcatgg 
ccccatccca 
aagaggctgg 
gttcaatgaa 
tccccgatgg 
ggaggtgacg 
caacaagatc 
ccatgctctg 
gacagcgggg 
gggagcgacc 
tgtacacccg 
agagacctaa 
tgaggtgatg 
tgcacacgag 
gcccgaaaga 
tttttgtttt 
ggagtgcagt 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
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ggcgaaatac 
ctcttgcctc 
acccaactaa 
ccaggctgct 
cccacagtgc 
cctttctgat 
cctttcccac 
tgacgagata 
ctcagaaaag 
ttttcatctt 
ttattttcac 
attttcatga 
tgattccaaa 
aaaaaaaaaa 



cctgctcact 
agcttcccca 
tttttgtatt 
ctcaaacccc 
tgggattaca 
cttcactaag 
tggtccatct 
agcagttatg 
acgcaccagc 
tagggaccag 
atgtggggag 
tttctttgta 
taatatgttt 
aaa 



gcagcctccg 
gtagctggga 
tttagtagag 
tgacctcaaa 
ggcatgggcc 
aacaaaagaa 
ggttttctct 
tgacctcacg 
ccagaagtgc 
aaccaaaccc 
gttaatctag 
gcatttggtg 
ccttccctca 



cttccctggt 
ccacaggtgc 
acagggtttc 
tgatgtgcct 
accacgccta 
gcagcaactt 
ccagggtctt 
tgcaaagcca 
agaactgcag 
accctttcta 
gaatgactcg 
cttgacgtat 
aaaaaaaaaa 



tcaagcgatt 
ccgccaccac 
accatgttgg 
gcttcagcct 
gcctcacgct 
gcaagggcgg 
gcaaaattcc 
ccaacagcca 
tcactgcacg 
cttccaagac 
tttaaggcct 
tattgtcctt 
aaaaaaaaaa 



1800 
1850 
1900 
1950 
2000 
2050 
2100 
2150 
2200 
2250 
2300 
2350 
2400 
2413 



<210> 2 

<211> 454 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> complete amino acid sequence of TADG-12 

protein 

<400> 2 



Met 


Gly 


Glu 


Asn 


Asp 
5 


Pro 


Pro 


Arg 


Ser 


Leu 


Phe 


Gly 


Leu 


Asp 










20 






Pro 


Asp 


Ala 


Asp 


Ala 


Val 


Ala 










35 






Phe 


Glu 


Val 


Phe 


Ser 


Gin 


Ser 










50 






lie 


Leu 


Ala 


Leu 


Ala 


lie 


Gly 










65 






Gly 


Lys 


Tyr 


Arg 


Cys 


Arg 


Ser 










80 






Thr 


Arg 


Cys 


Asp 


Gly 


Val 


Ser 










95 






Tyr 


Arg 


Cys 


Val 


Arg 


Val 


Gly 










110 






Phe 


Thr 


Ala 


Ala 


Ser 


Trp 


Lys 










125 






Gly 


His 


Tyr 


Ala 


Asn 


Val 


Ala 










140 






Tyr 


Val 


Ser 


Ser 


Asp 


Asn 


Leu 










155 






Phe 


Arg 


Glu 


Glu 


Phe 


Val 


Ser 










170 






Lys 


Val 


Thr 


Ala 


Leu 


His 


His 










185 






Ala 


Ser 


Gly 


His 


Val 


Val 


Thr 










200 






Arg 


Arg 


Gly 


Tyr 


Ser 


Ser 


Arg 










215 






Leu 


Ser 


Gin 


Trp 


Pro 


Trp 


Gin 










230 







Ala 


Val 


Glu 
10 


Ala 


Pro 


Phe 


Ser 


Phe 
15 


Asp 


Leu 


Lys 
25 


He 


Ser 


Pro 


Val 


Ala 
30 


Ala 


Gin 


He 
40 


Leu 


Ser 


Leu 


Leu 


Pro 
45 


Ser 


Ser 


Leu 
55 


Gly 


He 


He 


Ala 


Leu 
60 


Leu 


Gly 


He 
70 


His 


Phe 


Asp 


Cys 


Ser 
75 


Ser 


Phe 


Lys 
85 


Cys 


He 


Glu 


Leu 


He 
90 


Asp 


Cys 


Lys 
100 


Asp 


Gly 


Glu 


Asp 


Glu 
105 


Gly 


Gin 


Asn 
115 


Ala 


Val 


Leu 


Gin 


Val 
120 


Thr 


Met 


Cys 
130 


Ser 


Asp 


Asp 


Trp 


Lys 
135 


Cys 


Ala 


Gin 
145 


Leu 


Gly 


Phe 


Pro 


Ser 
150 


Arg 


Val 


Ser 
160 


Ser 


Leu 


Glu 


Gly 


Gin 
165 


He 


Asp 


His 
175 


Leu 


Leu 


Pro 


Asp 


Asp 
180 


Ser 


Val 


Tyr 
190 


Val 


Arg 


Glu 


Gly 


Cys 
195 


Leu 


Gin 


Cys 
205 


Thr 


Ala 


Cys 


Gly 


His 
210 


He 


Val 


Gly 
220 


Gly 


Asn 


Met 


Ser 


Leu 
225 


Ala 


Ser 


Leu 
235 


Gin 


Phe 


Gin 


Gly 


Tyr 
240 
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His 


Leu 


Cys 


Gly 


Gly 
245 


Ser 


Val 


He 


Thr 


Pro 
250 


Leu 


Trp 


He 


He 


Thr 

255 


Ala 


Ala 


His 


Cys 


Val 
260 


Tyr 


Asp 


Leu 


Tyr 


Leu 
265 


Pro 


Lys 


Ser 


Trp 


Thr 
270 


He 


Gin 


Val 


Gly 


Leu 
275 


Val 


Ser 


Leu 


Leu 


Asp 
280 


Asn 


Pro 


Ala 


Pro 


Ser 
285 


His 


Leu 


Val 


Glu 


Lys 
290 


He 


Val 


Tyr 


His 


Ser 
295 


Lys 


Tyr 


Lys 


Pro 


Lys 
300 


Arg 


Leu 


Gly 


Asn 


Asp 
305 


He 


Ala 


Leu 


Met 


Lys 
310 


Leu 


Ala 


Gly 


Pro 


Leu 
315 


Thr 


Phe 


Asn 


Glu 


Met 
320 


He 


Gin 


Pro 


Val 


Cys 
325 


Leu 


Pro 


Asn 


Ser 


Glu 
330 


Glu 


Asn 


Phe 


Pro 


Asp 
335 


Gly 


Lys 


Val 


Cys 


Trp 
340 


Thr 


Ser 


Gly 


Tirp 


Gly 
345 


Ala 


Thr 


Glu 


Asp 


Gly 
350 


Gly 


Asp 


Ala 


Ser 


Pro 
355 


Val 


Leu 


Asn 


His 


Ala 
360 


Ala 


Val 


Pro 


Leu 


He 
365 


Ser 


Asn 


Lys 


He 


Cys 
370 


Asn 


His 


Arg 


Asp 


Val 

375 


Tyr 


Gly 


Gly 


He 


He 
380 


Ser 


Pro 


Ser 


Met 


Leu 
385 


Cys 


Ala 


Gly 


Tyr 


Leu 

390 


Thr 


Gly 


Gly 


Val 


Asp 
395 


Ser 


Cys 


Gin 


Gly 


Asp 
400 


Ser 


Gly 


Gly 


Pro 


Leu 
405 


Val 


Cys 


Gin 


Glu 


Arg 
410 


Arg 


Leu 


Trp 


Lys 


Leu 
415 


Val 


Gly 


Ala 


Thr 


Ser 
420 


Phe 


Gly 


He 


Gly 


Cys 


Ala 


Glu 


Val 


Asn 


Lys 


Pro 


Gly Val 


Tyr 


Thr 










425 










430 










435 


Arg 


Val 


Thr 


Ser 


Phe 
440 


Leu 


Asp 


Trp 


He 


His 
445 


Glu 


Gin 


Met 


Glu 


Arg 
450 



Asp Leu Lys Thr 



<210> 3 

<211> 2544 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<223> entire cDNA seqiience of TADG-12 variant gene 

<400> 3 



cgggaaaggg ctgtgtttat gggaagccag taacactgtg gcctactatc 50 

tcttccgtgg tgccatctac atttttggga ctcgggaatt atgaggtaga 100 

ggtggaggcg gagccggatg tcagaggtcc tgaaatagtc accatggggg 150 

aaaatgatcc gcctgctgtt gaagccccct tctcattccg atcgcttttt 200 

ggccttgatg atttgaaaat aagtcctgtt gcaccagatg cagatgctgt 250 

tgctgcacag atcctgtcac tgctgccatt tgaagttttt tcccaatcat 300 

cgtcattggg gatcattgca ttgatattag cactggccat tggtctgggc 3 50 

atccacttcg actgctcagg gaagtacaga tgtcgctcat cctttaagtg 400 

tatcgagctg ataactcgat gtgacggagt ctcggattgc aaagacgggg 450 

aggacgagta ccgctgtgtc cgggtgggtg gtcagaatgc cgtgctccag 500 

gtgttcacag ctgcttcgtg gaagaccatg tgctccgatg actggaaggg 550 

tcactacgca aatgttgcct gtgcccaact gggtttccca agctatgtaa 600 

gttcagataa cctcagagtg agctcgctgg aggggcagtt ccgggaggag 650 

tttgtgtcca tcgatcacct cttgccagat gacaaggtga ctgcattaca 700 

ccactcagta tatgtgaggg agggatgtgc ctctggccac gtggttacct 750 

tgcagtgcac agcctgtggt catagaaggg gctacagctc acgcatcgtg 800 



SEQ 3/41 



BNSDOCID <WO 0052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



ggtggaaaca 
gttccagggc 
tcatcactgc 
gcagacagaa 
gaaaggacct 
agatacaggc 
tttccctgtt 
taccacagca 
gaagctggcc 
tgcccaactc 
ggatgggggg 
cgcggccgtc 
acggtggcat 
ggcgtggaca 
gaggaggctg 
cagaggtgaa 
tggatccacg 
acaagtagcc 
ctggactccc 
gagttccggc 
ccagcacaac 
tctcgctctg 
tgcagcctcc 
agtagctggg 
ttttagtaga 
ctgacctcaa 
aggcatgggc 
gaacaaaaga 
tggttttctc 
gtgacctcac 
cccagaagtg 
gaaccaaacc 
ggttaatcta 
agcatttggt 
tccttccctc 



tgtccttgct 
taccacctgt 
tgcacactgt 
gaggaaggaa 
agaccttccc 
ttgtacctcc 
ggacaatcca 
agtacaagcc 
gggccactca 
tgaagagaac 
ccacagagga 
cctttgattt 
catctccccc 
gctgccaggg 
tggaagttag 
caagcctggg 
agcagatgga 
acctgagttc 
gtgtaggaac 
accagtagcg 
cttcaagctg 
ttgcccaggc 
gcttccctgg 
accacaggtg 
gacagggttt 
atgatgtgcc 
caccacgcct 
agcagcaact 
tccagggtct 
gtgcaaagcc 
cagaactgca 
caccctttct 
ggaatgactc 
gcttgacgta 
aaaaaaaaaa 



ctcgcagtgg 
gcgggggctc 
gtttatgaga 
gctcctgtgc 
atagctaatt 
ccaagtcatg 
gccccatccc 
aaagaggctg 
cgttcaatga 
ttccccgatg 
tggaggtgac 
ccaacaagat 
tccatgctct 
ggacagcggg 
tgggagcgac 
gtgtacaccc 
gagagaccta 
ctgaggtgat 
ctgcacacga 
ggcccgaaag 
ctttttgttt 
tggagtgcag 
ttcaagcgat 
cccgccacca 
caccatgttg 
tgcttcagcc 
agcctcacgc 
tgcaagggcg 
tgcaaaattc 
accaacagcc 
gtcactgcac 
acttccaaga 
gtttaaggcc 
ttattgtcct 
aaaaaaaaaa 



ccctggcagg 
tgtcatcacg 
ttgtagctcc 
tggaggaaac 
ccagtggacc 
gaccatccag 
acttggtgga 
ggcaatgaca 
aatgatccag 
gaaaagtgtg 
gcctcccctg 
ctgcaaccac 
gcgcgggcta 
gggcccctgg 
cagctttggc 
gtgtcacctc 
aaaacctgaa 
gaagacagcc 
gcagacaccc 
aggcaccctt 
tttgtttttt 
tggcgaaata 
tctcttgcct 
cacccaacta 
gccaggctgc 
tcccacagtg 
tcctttctga 
gcctttccca 
ctgacgagat 
actcagaaaa 
gttttcatct 
cttattttca 
tattttcatg 
ttgattccaa 
aaaaaaaaaa 



ccagccttca 
cccctgtgga 
tagagaaagg 
ccacaaaaat 
atgttatggc 
gtgggtctag 
gaagattgtc 
tcgcccttat 
cctgtgtgcc 
ctggacgtca 
tcctgaacca 
agggacgtgt 
cctgacgggt 
tgtgtcaaga 
atcggctgcg 
cttcctggac 
gaggaagggg 
cgatcctccc 
ttggagctct 
ccatctgatt 
tgaggtggag 
ccctgctcac 
cagcttcccc 
atttttgtat 
tctcaaaccc 
ctgggattac 
tcttcactaa 
ctggtccatc 
aagcagttat 
gacgcaccag 
ttagggacca 
catgtgggga 
atttctttgt 
ataatatgtt 
aaaa 



850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1800 
1850 
1900 
1950 
2000 
2050 
2100 
2150 
2200 
2250 
2300 
2350 
2400 
2450 
2500 
2544 





<210> 


4 






















<211> 


294 






















<212> 


PRT 






















<213> 


Homo 


sapi ens 


















<220> 
























<223> 


complete 


amino acid 


sequence 


of 


TADG 


-12 








variant 


protein 


















<400> 


4 




















Met 


Gly Glu 


Asn Asp 


Pro 


Pro Ala 


Val 


Glu 


Ala 


Pro 


Phe 


Ser 


Phe 






5 








10 










15 


Arg 


Ser Leu 


Phe Gly 


Leu 


Asp Asp 


Leu 


Lys 


He 


Ser 


Pro 


Val 


Ala 






20 








25 










30 


Pro 


Asp Ala 


Asp Ala 


Val 


Ala Ala 


Gin 


He 


Leu 


Ser 


Leu 


Leu 


Pro 






35 








40 










45 


Phe 


Glu Val 


Phe Ser 


Gin 


Ser Ser 


Ser 


Leu 


Gly 


He 


He 


Ala 


Leu 






50 








55 








60 


He 


Leu Ala 


Leu Ala 


He 


Gly Leu 


Gly 


He 


His 


Phe 


Asp 


Cys 


Ser 






65 








70 










75 


Gly 


Lys Tyr 


Arg Cys 


Arg 


Ser Ser 


Phe 


Lys 


Cys 


He 


Glu 


Leu 


He 



SEQ 4/41 



wo 00/52044 



PCT/USOO/05612 



80 85 90 

Thr Arg Cys Asp Gly Val Ser Asp Cys Lys Asp Gly Glu Asp Glu 

95 100 105 

Tyr Arg Cys Val Arg Val Gly Gly Gin Asn Ala Val Leu Gin Val 

110 115 120 

Phe Thr Ala Ala Ser Trp Lys Thr Met Cys Ser Asp Asp Trp Lys 

125 130 135 

Gly His Tyr Ala Asn Val Ala Cys Ala Gin Leu Gly Phe Pro Ser 

140 145 150 

Tyr Val Ser Ser Asp Asn Leu Arg Val Ser Ser Leu Glu Gly Gin 

155 160 165 

Phe Arg Glu Glu Phe Val Ser lie Asp His Leu Leu Pro Asp Asp 

170 175 180 

Lys Val Thr Ala Leu His His Ser Val Tyr Val Arg Glu Gly Cys 

185 190 195 

Ala Ser Gly His Val Val Thr Leu Gin Cys Thr Ala Cys Gly His 

200 205 210 

Arg Arg Gly Tyr Ser Ser Arg lie Val Gly Gly Asn Met Ser Leu 

215 220 225 

Leu Ser Gin Trp Pro Trp Gin Ala Ser Leu Gin Phe Gin Gly Tyr 

230 235 240 

His Leu Cys Gly Gly Ser Val lie Thr Pro Leu Trp lie lie Thr 

245 250 255 

Ala Ala His Cys Val Tyr Glu lie Val Ala Pro Arg Glu Arg Ala 

260 265 270 

Asp Arg Arg Gly Arg Lys Leu Leu Cys Trp Arg Lys Pro Thr Lys 

275 280 285 

Met Lys Gly Pro Arg Pro Ser His Ser 

290 

<210> 5 
<211> 174 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> nucleotide sequence of the subclone containing 

the 180 bp band from the PGR product for TADG~12 
<400> 5 

tgggtggtga cggcggcgca ctgtgtttat gacttgtacc tccccaagtc 50 
atggaccatc caggtgggtc tagtttccct gttggacaat ccagccccat 100 
cccacttggt ggagaagatt gtctaccaca gcaagtacaa gccaaagagg 15 0 
ctgggcaacg acatcgccct ccta 174 

<210> 6 
<211> 58 
<212> PRT 

<213> Artificial sequence 

<220> 

<223> deduced amino acid sequence of the 180 bp band 

from the PGR product for TADG-12 
<400> 6 

Trp Val Val Thr Ala Ala His Cys Val Tyr Asp Leu Tyr Leu Pro 

5 10 15 

Lys Ser Trp Thr lie Gin Val Gly Leu Val Ser Leu Leu Asp Asn 



SEQ 5/41 



BNSDOCID <WO 0052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



20 25 30 

Pro Ala Pro Ser His Leu Val Glu Lys lie Val Tyr His Ser Lys 

35 40 45 

Tyr Lys Pro Lys Arg Leu Gly Asn Asp lie Ala Leu Leu 

50 55 



<210> 7 
<211> 328 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> nucleotide sequence of the subclone containing 

the 3 00 bp band from the PGR product for 
TADG-12 variant, which contains an additional 
insert of 133 bases 

<400> 7 



gggtggtgac 
agggcagaca 
aatgaaagga 
ggcagataca 
tagtttccct 
gtctaccaca 
cctaatcact 



ggcggcgcac 
gaagaggaag 
cctagacctt 
ggcttgtacc 
gttggacaat 
gcaagtacaa 
agtgcggccg 



tgtgtttatg 
gaagctcctg 
cccatagcta 
tccccaagtc 
ccagccccat 
gccaaagagg 
cctgcagg 



agattgtagc 
tgctggagga 
attccagtgg 
atggaccatc 
cccacttggt 
ctgggcaacg 



tcctagagaa 50 

aacccacaaa 100 

accatgttat 150 

caggtgggtc 200 

ggagaaga 1 1 2 50 

acatcgccct 3 00 
328 



<210> 8 
<211> 42 
<212> PRT 

<213> Artificial sequence 

<220> 

<223> deduced amino acid sequence of the 300 bp band 

from the PGR product for TADG-12 variant, which 
a truncated form of TADG-12 

<400> 8 



Val Val Thr Ala Ala His Cys Val Tyr Glu lie Val Ala Pro Arg 
5 10 15 

Glu Arg Ala Asp Arg Arg Gly Arg Lys Leu Leu Cys Trp Arg Lys 
20 25 30 

Pro Thr Lys Met Lys Gly Pro Arg Pro Ser His Ser 
35 40 



<210> 


9 


<211> 


34 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


DOMAIN 


<223> 


LDLR-A domain 




(Compc8 ) 


<400> 


9 



the complement subunit C8 



Cys Glu Gly Phe Val 
5 

Arg Leu Leu Cys Asn 
20 



Cys Ala Gin Thr Gly 
10 

Gly Asp Asn Asp Cys 
25 



Arg Cys Val Asn Arg 
15 

Gly Asp Gin Ser Asp 
30 



SEQ 6/41 



wo 00/52044 PCT/USOO/0561 2 



Glu 


Ala Asn 


Cys 










<210> 


10 










<211> 


34 










<212> 


PRT 










<213> 


Homo sapi ens 










<220> 












<221> 


DOMAIN 










<223> 


LDLR-A domain of 


the 


serine protease 








luatriptase (Matr) 








<400> 


10 








Cys 


Pro Gly 


Gin Phe Thr Cys Arg 


Thr 


Gly Arg Cys lie Arg 


Lys 






5 




10 


15 


Glu 


Leu Arg 


Cys Asp Gly Trp Ala 


Asp 


Cys Thr Asp His Ser 


Asp 






20 




25 


30 


Glu 


Leu Asn 


Cys 










<210> 


11 










<211> 


37 










<212> 


PRT 










<213> 


Homo sapiens 










<220> 












<221> 


DOMAIN 










<223> 


LDLR-A domain of 


the 


glycoprotein GP3 00 








(Gp300-1) 










<400> 


11 








Cys 


Gin Gin 


Gly Tyr Phe Lys Cys 


Gin 


Ser Glu Gly Gin Cys 


lie 






5 




10 


15 


Pro 


Ser Ser 


Trp Val Cys Asp Gin 


Asp 


Gin Asp Cys Asp Asp 


Gly 






20 




25 


3 0 


Ser 


Asp Glu 


Arg Gin Asp Cys 












35 










<210> 


12 










<211> 


35 










<212> 


PRT 










<213> 


Homo sapiens 










<220> 












<221> 


DOMAIN 










<223> 


LDLR - A doma in of 


the 


glycoprotein GP3 00 





(Gp300-2) 
<400> 12 



Cys Ser Ser His Gin lie Thr Cys Ser Asn Gly Gin Cys lie Pro 
5 10 15 

Ser Glu Tyr Arg Cys Asp His Val Arg Asp Cys Pro Asp Gly Ala 
20 25 30 

Asp Glu Asn Asp Cys 
35 

<210> 13 
<211> 35 



SEQ 7/41 



wo 00/52044 



PCT/USOO/05612 



<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<222> 74. . .108 

<223> LDLR-A domain of TADG-12 

<400> 13 



Cys Ser Gly Lys Tyr Arg Cys Arg Ser Ser Phe Lys Cys lie Glu 
5 10 15 

Leu He Thr Arg Cys Asp Gly Val Ser Asp Cys Lys Asp Gly Glu 
20 25 30 

Asp Glu Tyr Arg Cys 
35 



<210> 14 

<211> 36 

<212> PRT 

<213> Homo ScipiGns 

<220> 

<221> DOMAIN 

<223> LDLR-A domain of the serine protease TMPRSS2 

Tmprss2 

<400> 14 



Cys Ser Asn Ser Gly He Glu 
5 

Asn Pro Ser Asn Trp Cys Asp 
20 

Glu Asp Glu Asn Arg Cys 
35 



Cys Asp Ser Ser Gly Thr Cys He 
10 15 

Gly Val Ser His Cys Pro Gly Gly 
25 30 



<210> 15 

<211> 101 

<212> PRT 

<213> Bos taurus 

<220> 

<221> DOMAIN 

<223> SRCR domain of bovine enter okinase (BovEntk) 

<400> 15 



Val Arg Leu 
He Phe His 
Glu Leu Arg 
Gly Val Gin 
Gly Pro He 
Ser He Glu 
Ser His Asp 



Val 


Gly 


Gly 




5 




Glu 


Gly 


Gin 




20 




Gly Gly 


Leu 




35 




Ser 


Val 


His 




50 




Trp 


Leu 


Asn 




65 




Glu 


Cys 


Arg 




80 




Glu 


Asp 


Ala 




95 





Ser Gly Pro 
Trp Gly Thr 
Val Val Cys 
Lys Arg Ala 
Glu Val Phe 
He Arg Gin 
Gly Val Thr 



His 


Glu 


Gly 


10 






Val 


Cys 


Asp 


25 






Arg 


Ser 


Leu 


40 






Tyr 


Phe 


Gly 


55 






Cys 


Phe 


Gly 


70 






Trp 


Gly 


Val 


85 






Cys 


Thr 




100 







Arg 


Val 


Glu 






15 


Asp 


Arg 


Trp 






30 


Gly 


Tyr 


Lys 






45 


Lys 


Gly 


Thr 






60 


Lys 


Glu 


Ser 






75 


Arg 


Ala 


Cys 






90 



SEQ 8/41 



wo 00/52044 PCT/USOO/0561 2 

<210> 16 

<211> 101 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<223> SRCR domain of human macrophage scavenger 

receptor (MacSR) 

<400> 16 



Val 


Arg 


Leu 


Val 


Gly 


Gly 


Ser 


Gly 


Pro 


His 


Glu 


Gly 


Arg 


Val 


Glu 










5 










10 










15 


He 


Leu 


His 


Ser 


Gly 


Gin 


Trp 


Gly 


Thr 


He 


Cys 


Asp 


Asp 


Arg 


Trp 










20 










25 










30 


Glu 


Val 


Arg 


Val 


Gly 


Gin 


Val 


Val 


Cys 


Arg 


Ser 


Leu 


Gly 


Tyr 


Pro 










35 










40 










45 


Gly 


Val 


Gin 


Ala 


Val 


His 


Lys 


Ala 


Ala 


His 


Phe 


Gly 


Gin 


Gly 


Thr 










50 










55 










60 


Gly 


Pro 


He 


Trp 


Leu 


Asn 


Glu 


Val 


Phe 


Cys 


Phe 


Gly 


Arg 


Glu 


Ser 










65 










70 










75 


Ser 


He 


Glu 


Glu 


Cys 


Lys 


He 


Arg 


Gin 


Trp 


Gly 


Thr 


Arg 


Ala 


Cys 










80 










85 










90 


Ser 


His 


Ser 


Glu 


Asp 


Ala 


Gly 


Val 


Thr 


Cys 


Thr 











95 100 



<210> 17 

<211> 98 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<222> 109... 206 

<223> SRCR domain of TADG-12 {TADG12) 

<400> 17 



Val 


Arg 


Val 


Gly 


Gly 


Gin 


Asn 


Ala 


Val 


Leu 


Gin 


Val 


Phe 


Thr 


Ala 










5 










10 










15 


Ala 


Ser 


Trp 


Lys 


Thr 


Met 


Cys 


Ser 


Asp 


Asp 


Trp 


Lys 


Gly 


His 


Tyr 










20 










25 










30 


Ala 


Asn 


Val 


Ala 


Cys 


Ala 


Gin 


Leu 


Gly 


Phe 


Pro 


Ser 


Tyr 


Val 


Ser 










35 










40 










45 


Ser 


Asp 


Asn 


Leu 


Arg 


Val 


Ser 


Ser 


Leu 


Glu 


Gly 


Gin 


Phe 


Arg 


Glu 










50 










55 










60 


Glu 


Phe 


Val 


Ser 


He 


Asp 


His 


Leu 


Leu 


Pro 


Asp 


Asp 


Lys 


Val 


Thr 










65 










70 










75 


Ala 


Leu 


His 


His 


Ser 


Val 


Tyr 


Val 


Arg 


Glu 


Gly 


Cys 


Ala 


Ser 


Gly 










80 










85 










90 


His 


Val 


Val 


Thr 


Leu 


Gin 


Cys 


Thr 

















95 



<210> 18 

<211> 94 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 



SEQ 9/41 



BIMSCXXilD <WO 0052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



<223> SRCR domain of the serine protease TMPRSS2 

(Tmprss2 ) 
<400> 18 



V O-X 


ArQ^ 


Leu 


±yr 




Pro 


Asn 


±rllc 


lie 


Leu 


tj_Ln 


Ixiet. 


Tyr 


Ser 


Ser 










c 
D 










10 








lb 


Gin 


ArQ^ 


Lys 


Ser 






Pro 


V CLX 


Cys 


Gin 


Asp 


Asp 


Trp 


Asn 


LjXU 










20 










25 










30 


Asn 


Tyr 


Gly 


Arg 


Ala 


Ala 


Cys 


Arg 


Asp 


Met 


Gly 


Tyr 


Lys 


Asn 


Asn 










35 










40 










45 


Phe 


Tyr 


Ser 


Ser 


Gin 


Gly 


lie 


Val 


Asp 


Asp 


Ser 


Gly 


Ser 


Thr 


Ser 










50 










55 










60 


Pile 


Met 


Lys 


Leu 


Asn 


Thr 


Ser 


Ala 


Gly Asn 


Val 


Asp 


He 


Tyr 


Lys 










65 










70 










75 


Lys 


Leu 


Tyr 


His 


Ser 


Asp 


Ala 


Cys 


Ser 


Ser 


Lys 


Ala 


Val 


Val 


Ser 










80 










85 










90 


Leu 


Arg 


Cys 


Leu 

























<210> 19 

<211> 90 

<212> PRT 

< 2 1 3 > Homo sap>i ens 

<220> 

<221> DOMAIN 

<223> SRCR domain of h-uman enterokinase (HumEntk) 

<400> 19 



Val 


Arg 


Phe 


Phe 


Asn 


Gly 


Thr 


Thr 


Asn 


Asn 


Asn 


Gly 


Leu 


Val 


Arg 










5 










10 










15 


Phe 


Arg 


He 


Gin 


Ser 


He 


Trp 


His 


Thr 


Ala 


Cys 


Ala 


Glu 


Asn 


Trp 










20 










25 










30 


Thr 


Thr 


Gin 


He 


Ser 


Asn 


Asp 


Val 


Cys 


Gin 


Leu 


Leu 


Gly 


Leu 


Gly 










35 










40 










45 


Ser 


Gly 


Asn 


Ser 


Ser 


Lys 


Pro 


He 


Phe 


Ser 


Thr 


Asp 


Gly 


Gly 


Pro 










50 










55 










60 


Phe 


Val 


Lys 


Leu 


Asn 


Thr 


Ala 


Pro 


Asp 


Gly 


His 


Leu 


He 


Leu 


Thr 










65 










70 










75 


Pro 


Ser 


Gin 


Gin 


Cys 


Leu 


Gin 


Asp 


Ser 


Leu 


He 


Arg 


Leu 


Gin 


Cys 



80 85 90 



<210> 20 

<211> 149 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 





<223> 




protease 


domain 


of protease 


M (ProM) 








<400> 




20 






















Leu 


Trp Val 


Leu 


Thr 


Ala 


Ala 


His 


Cys 


Lys 


Lys 


Pro 


Asn 


Leu 


Gin 








5 










10 










15 


Val 


Phe Leu 


Gly 


Lys 


His 


Asn 


Leu 


Arg 


Gin 


Arg 


Glu 


Ser 


Ser 


Gin 








20 










25 










30 


Glu 


Gin Ser 


Ser 


Val 


Val 


Arg 


Ala 


Val 


He 


His 


Pro 


Asp 


Tyr 


Asp 








35 










40 










45 


Ala 


Ala Ser 


His 


Asp 


Gin 


Asp 


He 


Met 


Leu 


Leu 


Arg 


Leu 


Ala 


Arg 



SEQ 10/41 



BNSDOCID: <WO 0052044A1_L> 



wo 00/52044 



PCT/USOO/05612 











50 










55 










60 


Pro 


Ala 


Lys 


Leu 


Ser 
65 


Glu 


Leu 


lie 


Gin 


Pro 
70 


Leu 


Pro 


Leu 


Glu 


Arg 
75 


Asp 


Cys 


Ser 


Ala 


Asn 
80 


Thr 


Thr 


Ser 


Cys 


His 
85 


He 


Leu 


Gly 


Trp 


Gly 
90 


Lys 


Thr 


Ala 


Asp 


Gly 
95 


Asp 


Phe 


Pro 


Asp 


Thr 
100 


He 


Gin 


Cys 


Ala 


Tyr 
105 


lie 


His 


Leu 


Val 


Ser 
110 


Arg 


Glu 


Glu 


Cys 


Glu 
115 


His 


Ala 


Tyr 


Pro 


Gly 
120 


Gin 


He 


Thr 


Gin 


Asn 


Met 


Leu 


Cys 


Ala 


Gly Asp Glu 


Lys 


Tyr 


Gly 










125 










130 










135 


Lys 


Asp 


Ser 


Cys 


Gin 
140 


Gly 


Asp 


Ser 


Gly 


Gly 
145 


Pro 


Leu 


Val 


Cys 





<210> 21 

<211> 151 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<223> protease domain of trypsinogen I (Tryl) 

<400> 21 



Gin 


Trp 


Val 


Val 


Ser 
5 


Ala 


Gly 


His 


Cys 


Tyr 
10 


Lys 


Ser 


Arg 


He 


Gin 
15 


Val 


Arg 


Leu 


Gly 


Glu 
20 


His 


Asn 


He 


Glu 


Val 
25 


Leu 


Glu 


Gly 


Asn 


Glu 
30 


Gin 


Phe 


He 


Asn 


Ala 
35 


Ala 


Lys 


He 


He 


Arg 
40 


His 


Pro 


Gin 


Tyr 


Asp 
45 


Arg 


Lys 


Thr 


Leu 


Asn 
50 


Asn 


Asp 


He 


Met 


Leu 
55 


He 


Lys 


Leu 


Ser 


Ser 

60 


Arg 


Ala 


Val 


He 


Asn 

65 


Ala 


Arg 


Val 


Ser 


Thr 
70 


He 


Ser 


Leu 


Pro 


Thr 
75 


Ala 


Pro 


Pro 


Ala 


Thr 
80 


Gly 


Thr 


Lys 


Cys 


Leu 
85 


He 


Ser 


Gly 


Trp 


Gly 
90 


Asn 


Thr 


Ala 


Ser 


Ser 
95 


Gly 


Ala 


Asp 


Tyr 


Pro 
100 


Asp 


Glu 


Leu 


Gin 


Cys 
105 


Leu 


Asp 


Ala 


Pro 


Val 
110 


Leu 


Ser 


Gin 


Ala 


Lys 
115 


Cys 


Glu 


Ala 


Ser 


Tyr 
120 


Pro 


Gly 


Lys 


He 


Thr 
125 


Ser 


Asn 


Met 


Phe 


Cys 
130 


Val 


Gly 


Phe 


Leu 


Glu 
135 


Gly 


Gly 


Lys 


Asp 


Ser 
140 


Cys 


Gin 


Gly 


Asp 


Ser 
145 


Gly 


Gly 


Pro 


Val 


Val 
150 


Cys 































<210> 22 

<211> 158 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<223> protease domain of plasma kallikrein (Kal) 

<400> 22 

Gin Trp Val Leu Thr Ala Ala His Cys Phe Asp Gly Leu Pro Leu 



SEQ 11/41 



wo 00/52044 



PCT/USOO/05612 



5 





Asp 


Vo-X 


Trp 


Arg 
20 


xxe 


Tyr 


Ser 




± III. 


Lys 


Asp 


± 111. 

35 


Pro 


jrne 


Ser 


rl xS 


vjjxn 


Asn 


Tyr 


Lys 
50 


vax 


Ser 


LjXU 




Lys 


Leu. 


ijjxn 


/\xa 
65 


Pro 


Leu 


Asn 




Cys 


Leu 


Pro 


ber 
80 


Lys 


LjXy 


Asp 


Cys 


Trp 


V ax 


iiir 


tjxy 
95 


Trp 


Ljxy 


r^ne 


Gin 


Asn 


He 


Leu 


Gin 
110 


Lys 


Val 


Asn 


Glu 


Cys 


Gin 


Lys 


Arg 
125 


Tyr 


Gin 


Asp 


Val 


Cys 


Ala 


Gly 


Tyr 
140 


Lys 


Glu 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Val 


Cys 



155 



<210> 23 

<211> 157 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<223> protease domain 

<400> 23 



Leu 


Trp 


He 


lie 


Thr 
5 


Ala 


Ala 


His 


Pro 


Lys 


Ser 


Trp 


Thr 


He 


Gin 


Val 










20 








Asn 


Pro 


Ala 


Pro 


Ser 


His 


Leu 


Val 










35 








Lys 


Tyr 


Lys 


Pro 


Lys 


Arg 


Leu 


Gly 










50 








Leu 


Ala 


Gly 


Pro 


Leu 


Thr 


Phe 


Asn 










65 








Leu 


Pro 


Asn 


Ser 


Glu 


Glu 


Asn 


Phe 










80 








Thr 


Ser 


Gly 


Trp 


Gly 


Ala 


Thr 


Glu 










95 








Val 


Leu 


Asn 


His 


Ala 


Ala 


Val 


Pro 










110 








Asn 


His 


Arg 


Asp 


Val 


Tyr 


Gly 


Gly 










125 








Cys 


Ala 


Gly 


Tyr 


Leu 


Thr 


Gly 


Gly 










140 








Ser 


Gly 


Gly 


Pro 


Leu 


Val 


Cys 












155 









<210> 24 
<211> 159 





lU 










15 


Cj-Ly 




Leu 


Asn 


Leu 


Ser 


Asp 




o c 

ZD 










3 0 


VjXn 


xxe 


Lys 


Glu 


He 


xie 


lie 
















Ljxy 


Asn 


XT'! r-1 

ills 


Asp 


T n ^ 
xxe 


Axa 


Leu 




bb 










60 


Tyr 


Thr 


Glu 


Pne 


Gin 


Lys 


Pro 




/ U 










/ O 


Thr 


Ser 


1 nr 


xxe 


Tyr 


Thr 


Asn 




o c 










O A 


oer 


Lys 


(jiXU 


Lys 




(jXU 


Tl j==i 

xxe 




100 










105 


He 


Pro 


Leu 


Val 


Thr 


Asn 


Glu 




115 










120 


Tyr 


Lys 


He 


Thr 


Gin 


Arg 


Met 




130 










135 


Gly 


Lys 


Asp 


Ala 


Cys 


Lys 


Gly 




145 










150 



of TADG-12 (TADG12) 



Cys 


Val 


Tyr 


Asp 


Leu 


Tyr 


Leu 




10 










15 


Gly 


Leu 


Val 


Ser 


Leu 


Leu 


Asp 




25 










30 


Glu 


Lys 


He 


Val 


Tyr 


His 


Ser 




40 










45 


Asn 


Asp 


He 


Ala 


Leu 


Met 


Lys 




55 










60 


Glu 


Met 


He 


Gin 


Pro 


Val 


Cys 




70 










75 


Pro 


Asp 


Gly 


Lys 


Val 


Cys 


Trp 




85 










90 


Asp 


Gly 


Gly 


Asp 


Ala 


Ser 


Pro 




100 










105 


Leu 


He 


Ser 


Asn 


Lys 


He 


Cys 




115 










120 


He 


He 


Ser 


Pro 


Ser 


Met 


Leu 




130 










135 


Val 


Asp 


Ser 


Cys 


Gin 


Gly Asp 




145 










150 



SEQ 12/41 



wo 00/52044 



PCT/USOO/05612 



<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<223> protease domain of TMPRSS2 (Tmprss2) 

<400> 24 



Glu 


Trp 


He 


Val 


Thr 
5 


Ala 


Ala 


His 


Cys 


Val 
10 


Glu 


Lys 


Pro 


Leu 


Asn 
15 


Asn 


Pro 


Trp 


His 


Trp 

20 


Thr 


Ala 


Phe 


Ala 


Gly 
25 


He 


Leu 


Arg 


Gin 


Ser 

30 


Phe 


Met 


Phe 


Tyr 


Gly 
35 


Ala 


Gly 


Tyr 


Gin 


Val 
40 


Gin 


Lys 


Val 


He 


Ser 
45 


His 


Pro 


Asn 


Tyr 


Asp 
50 


Ser 


Lys 


Thr 


Lys 


Asn 
55 


Asn 


Asp 


He 


Ala 


Leu 

60 


Met 


Lys 


Leu 


Gin 


Lys 
65 


Pro 


Leu 


Thr 


Phe 


Asn 
70 


Asp 


Leu 


Val 


Lys 


Pro 

75 


Val 


Cys 


Leu 


Pro 


Asn 
80 


Pro 


Gly 


Met 


Met 


Leu 

85 


Gin 


Pro 


Glu 


Gin 


Leu 
90 


Cys 


Trp 


He 


Ser 


Gly 
95 


Trp 


Gly 


Ala 


Thr 


Glu 

100 


Glu 


Lys 


Gly 


Lys 


Thr 

105 


Ser 


Glu 


Val 


Leu 


Asn 
110 


Ala 


Ala 


Lys 


Val 


Leu 
115 


Leu 


He 


Glu 


Thr 


Gin 

120 


Arg 


Cys 


Asn 


Ser 


Arg 
125 


Tyr 


Val 


Tyr 


Asp 


Asn 
130 


Leu 


He 


Thr 


Pro 


Ala 
135 


Met 


He 


Cys 


Ala 


Gly 
140 


Phe 


Leu 


Gin 


Gly 


Asn 
145 


Val 


Asp 


Ser 


Cys 


Gin 
150 


Gly 


Asp 


Ser 


Gly 


Gly 
155 


Pro 


Leu 


Val 


Thr 















<210> 25 

<211> 164 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> DOMAIN 

<223> protease domain of Hepsin (Heps) 

<400> 25 



Asp 


Trp 


Val 


Leu 


Thr 


Ala 


Ala 


His 


Cys 


Phe 


Pro 


Glu 


Arg 


Asn 


Arg 










5 










10 










15 


Val 


Leu 


Ser 


Arg 


Trp 


Arg 


Val 


Phe 


Ala 


Gly 


Ala 


Val 


Ala 


Gin 


Ala 










20 










25 










30 


Ser 


Pro 


His 


Gly 


Leu 


Gin 


Leu 


Gly Val 


Gin 


Ala 


Val 


Val 


Tyr 


His 










35 










40 










45 


Gly 


Gly 


Tyr 


Leu 


Pro 


Phe 


Arg 


Asp 


Pro 


Asn 


Ser 


Glu 


Glu 


Asn 


Ser 










50 










55 










60 


Asn 


Asp 


He 


Ala 


Leu 


Val 


His 


Leu 


Ser 


Ser 


Pro 


Leu 


Pro 


Leu 


Thr 










65 










70 










75 


Glu 


Tyr 


He 


Gin 


Pro 


Val 


Cys 


Leu 


Pro 


Ala 


Ala 


Gly 


Gin 


Ala 


Leu 










80 










85 










90 


Val 


Asp 


Gly 


Lys 


He 


Cys 


Thr 


Val 


Thr 


Gly 


Trp 


Gly Asn 


Thr 


Gin 










95 










100 










105 


Tyr 


Tyr 


Gly 


Gin 


Gin 


Ala 


Gly 


Val 


Leu 


Gin 


Glu 


Ala 


Arg 


Val 


Pro 










110 










115 










120 


He 


He 


Ser 


Asn 


Asp 


Val 


Cys 


Asn 


Gly 


Ala 


Asp 


Phe 


Tyr 


Gly Asn 



SEQ 13/41 



BNSDOCtD <WO 00520*4A1 I > 



wo 00/52044 



PCT/USOO/05612 



125 

Gin lie Lys Pro Lys Met Phe Cys Ala 
140 

lie Asp Ala Cys Gin Gly Asp Ser Gly 
155 

<210> 26 
<211> 23 
<212> DNA 

<213> Artificial sequence 

<220> 

<221> primer__bind 

<222> 6, 9, 12, 15, 18 

<223> forward redundant primer for the consensus 

sequences of amino acids surrounding the catalytic 
triad for serine proteases, n = inosine 

<400> 26 

tgggtngtna cngcngcnca ytg 23 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial sequence 

<220> 

<221> primer_bind 

<222> 3, 6, 9, 12, 15, 18 

<223> reverse redundant primer for the consensus 

sequences of amino acids surrounding the catalytic 
triad for serine proteases, n = inosine 



<400> 


27 






arnarngcna 


tntcnttncc 




20 


<210> 


28 






<211> 


20 






<212> 


DNA 






<213> 


Artificial sequence 






<220> 








<221> 


primer_bind 






<223> 


forward oligonucleotide primer 
used for quantitative PGR 


for 


TADG-12 


<400> 


28 






gaaacatgtc 


cttgctctcg 




20 


<210> 


29 






<211> 


20 






<212> 


DNA 






<213> 


Artificial sequence 






<220> 








<221> 


primer_bind 






<223> 


reverse oligonucleotide primer 
used for quantitative PGR 


for 


TADG-12 


<400> 


29 







130 

Gly Tyr Pro 
145 

Gly Pro Phe 
160 



135 

Glu Gly Gly 
150 

Val Cys 



SEQ 14/41 



wo 00/52044 



PCT/USOO/05612 



actaacttcc acagcctcct 



20 



<210> 30 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> primer_bind 

<223> forward oligonucleotide primer for TADG-12 

variant (TADG-12V) used for quantitative PCR 

<400> 30 



tccaggtggg tctagtttcc 



20 



<210> 


31 


<211> 


20 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<221> 


primer_bind 


<223> 


reverse oligonucleotide 




var i an t ( TADG -12V) u s ed 


<400> 


31 



ctctttggct tgtacttgct 



20 



<210> 32 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<2 2 1> pr imer_bind 

<223> forward oligonucleotide primer for p-tubulin 

used as an internal control for quantitative PCR 

<400> 32 



cgcatcaacg tgtactacaa 



20 



<210> 33 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> primer_bind 

<223> reverse oligonucleotide primer for p-tubulin 

used as an internal control for quantitative PCR 

<400> 33 



tacgagctgg tggactgaga 



20 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



34 
12 
PRT 

Artificial sequence 

a poly- lysine linked multiple antigen peptide 



SEQ 15/41 



BNSCXXID <WO 0052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



derived from the TADG-12 carboxy- terminal protein 
sequence, present in full length TADG-12, but not 
in TADG-12V 
<400> 34 

Trp lie His Glu Gin Met Glu Arg Asp Leu Lys Thr 
5 10 

<210> 35 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 40... 48 

<223> TAr)G-12 peptide 

<400> 35 

lie Leu Ser Leu Leu Pro Phe Glu Val 
5 

<210> 36 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 144. . .152 

<223> TADG-12 peptide 

<400> 36 

Ala Gin Leu Gly Phe Pro Ser Tyr Val 
5 

<210> 37 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 225... 233 

<223> TADG-12 peptide 

<400> 37 

Leu Leu Ser Gin Trp Pro Trp Gin Ala 
5 

<210> 38 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 252... 260 

<223> TADG-12 peptide 

<400> 38 

Trp lie lie Thr Ala Ala His Cys Val 
5 



SEQ 16/41 



wo 00/52044 



PCT/USOO/05612 



<210> 39 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 356 . . .364 

<223> TADG-12 peptide 

<400> 39 

Val Leu Asn His Ala Ala Val Pro Leu 
5 

<210> 40 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 176... 184 

<223> TADG-12 peptide 

<400> 40 

Leu Leu Pro Asp Asp Lys Val Thr Ala 
5 

<210> 41 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 13 . . ,21 

<223> TADG-12 peptide 

<400> 41 

Phe Ser Phe Arg Ser Leu Phe Gly Leu 
5 

<210> 42 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 151. . .159 

<22 3> TADG-12 peptide 

<400> 42 

Tyr Val Ser Ser Asp Asn Leu Arg Val 
5 

<210> 43 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 436 . . .444 

<223> TADG-12 peptide 

<400> 43 



SEQ 17/41 



BNSDOCID <WO OOS20JUA1 I > 



wo 00/52044 



PCT/USOO/05612 



Arg Val Thr Ser Phe Leu Asp Trp lie 
5 

<210> 44 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 234. . .242 

<223> TADG-12 peptide 

<400> 44 

Ser Leu Gin Phe Gin Gly Tyr His Leu 
5 

<210> 45 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 181. . .189 

<223> TADG-12 peptide 

<400> 45 

Lys Val Thr Ala Leu His His Ser Val 
5 

<210> 46 
<211> 9 
<212> PRT 

< 2 1 3 > Homo sspi ens 
<220> 

<222> 183 . . .191 

<223> TADG-12 peptide 

<400> 46 

Thr Ala Leu His His Ser Val Tyr Val 
5 

<210> 47 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 411. . .419 

<223> TADG-12 peptide 

<400> 47 

Arg Leu Trp Lys Leu Val Gly Ala Thr 
5 

<210> 48 
<211> 9 
<212> PRT 

< 2 1 3 > Homo sapi ens 



SEQ 18/41 



BNSDOCID: <WO 0052044A1J_> 



wo 00/52044 



PCT/USO0/056I2 



<220> 

<222> 60. . . 68 

<223> TADG-12 peptide 

<400> 48 

Leu lie Leu Ala Leu Ala lie Gly Leu 
5 

<210> 49 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 227.,. 235 

<223> TADG-12 peptide 

<400> 49 

Ser Gin Trp Pro Trp Gin Ala Ser Leu 
5 

<210> 50 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 301... 309 

<223> TADG-12 peptide 

<400> 50 

Arg Leu Gly Asn Asp lie Ala Leu Met 
5 

<210> 51 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapi ens 

<220> 

<222> 307 . . .315 

<223> TADG-12 peptide 

<400> 51 

Ala Leu Met Lys Leu Ala Gly Pro Leu 
5 

<210> 52 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 262 . . .270 

<223> TADG-12 peptide 

<400> 52 

Asp Leu Tyr Leu Pro Lys Ser Trp Thr 
5 



SEQ 19/41 



BNSDOCID fWO OO520d4A1 I > 



wo 00/52044 



PCT/USOO/05612 



<210> 53 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 416,.. 424 

<223> TADG-12 peptide 

<400> 53 



Leu Val Gly Ala Thr Ser Phe Gly lie 
5 



<210> 54 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapi ens 

<220> 

<222> 54, . .62 

<223> TADG-12 peptide 

<400> 54 



Ser Leu Gly lie lie Ala Leu lie Leu 
5 



<210> 55 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 218... 226 

<223> TADG-12 peptide 

<400> 55 



lie Val Gly Gly Asn Met Ser Leu Leu 
5 



<210> 56 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 35 . . .43 

<223> TADG-12 peptide 

<400> 56 



Ala Val Ala Ala Gin lie Leu Ser Leu 
5 



<210> 57 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 271... 279 

<223> TADG-12 peptide 

<400> 57 



SEQ 20/41 



wo 00/52044 



PCT/USOO/05612 



lie Gin Val Gly Leu Val Ser Leu Leu 
5 

<210> 58 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 397... 405 

<223> TADG-12 peptide 

<400> 58 

Cys Gin Gly Asp Ser Gly Gly Pro Leu 
5 

<210> 59 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<22Q> 

<222> 210. . .278 

<223> TADG-12 peptide 

<400> 59 

Thr lie Gin Val Gly Leu Val Ser Leu 
5 

<210> 60 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 56. . .64 

<223> TADG-12 peptide 

<400> 60 

Gly lie lie Ala Leu lie Leu Ala Leu 
5 

<210> 61 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 110... 118 

<223> TADG-12 peptide 

<400> 61 

Arg Val Gly Gly Gin Asn Ala Val Leu 
5 

<210> 62 

<211> 9 

<212> PRT 

<213> Homo sapiens 



SEQ 21/41 



BNSDOCID <WO 0052O44A1 I > 



wo 00/52044 



PCTAJSOO/05612 



<220> 
<222> 
<223> 
<400> 



217 . . .225 
TADG-12 peptide 
62 



Arg lie Val Gly Gly Asn Met Ser Leu 
5 



<210> 63 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 130... 138 

<223> TADG-12 peptide 

<400> 63 



Cys Ser Asp Asp Trp Lys Gly His Tyr 
5 



<210> 64 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 8... 16 

<223> TADG-12 peptide 

<400> 64 



Ala Val Glu Ala Pro Phe Ser Phe Arg 
5 



<210> 65 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 328... 336 

<223> TADG-12 peptide 

<400> 65 



Asn Ser Glu Glu Asn Phe Pro Asp Gly 
5 



<210> 66 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 3... 11 

<223> TADG-12 peptide 

<400> 66 



Glu Asn Asp Pro Pro Ala Val Glu Ala 
5 



SEQ 22/41 



BNSDOCID: <WO 0052044A1_L> 



wo 00/52044 



PCT/USOO/05612 



<210> 67 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapi ens 

<220> 

<222> 98 . . . 106 

<223> TADG-12 peptide 

<400> 67 

Asp Cys Lys Asp Gly Glu Asp Glu Tyr 
5 

<210> 68 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 346. . .354 

<223> TADG-12 peptide 

<400> 68 

Ala Thr Glu Asp Gly Gly Asp Ala Ser 
5 

<210> 69 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 360. . .368 

<223> TADG-12 peptide 

<400> 69 

Ala Ala Val Pro Leu lie Ser Asn Lys 
5 

<210> 70 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 153. . .161 

<223> TADG-12 peptide 

<400> 70 

Ser Ser Asp Asn Leu Arg Val Ser Ser 

5 

<210> 71 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 182. . .190 

<223> TADG-12 peptide 

<400> 71 



SEQ 23/41 



BNSDOCID cWO O052044A1 I > 



wo 00/52044 PCT/USOO/056 1 2 



Val Thr Ala Leu His His Ser Val Tyr 
5 



<210> 72 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 143... 151 

<223> TADG-12 peptide 

<400> 72 



Cys Ala Gin Leu Gly Phe Pro Ser Tyr 
5 



<210> 73 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 259.., 267 

<223> TADG-12 peptide 

<400> 73 



Cys Val Tyr Asp Leu Tyr Leu Pro Lys 
5 



<210> 74 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 369. . .377 

<223> TADG-12 peptide 

<400> 74 



lie Cys Asn His Arg Asp Val Tyr Gly 
5 



<210> 75 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 278. . .286 

<223> TADG-12 peptide 

<400> 75 



Leu Leu Asp Asn Pro Ala Pro Ser His 
5 



<210> 76 

<211> 9 

<212> PRT 

<213> Homo sapiens 



SEQ 24/41 



BNSDOCID: <WO_0052044A1J_> 



wo 00/52044 



PCT/USOO/05612 



<220> 

<222> 426 . . .434 

<223> TADG-12 peptide 

<400> 76 

Cys Ala Glu Val Asn Lys Pro Gly Val 
5 

<210> 77 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 32... 40 

<223> TADG-12 peptide 

<400> 77 

Asp Ala Asp Ala Val Ala Ala Gin lie 
5 

<210> 78 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 406 . . .414 

<223> TADG-12 peptide 

<400> 78 

Val Cys Gin Glu Arg Arg Leu Trp Lys 
5 

<210> 79 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 329. . .337 

<223> TADG-12 peptide 

<400> 79 

Ser Glu Glu Asn Phe Pro Asp Gly Lys 
5 

<210> 80 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 303 . . .311 

<223> TADG-12 peptide 

<400> 80 

Gly Asn Asp lie Ala Leu Met Lys Leu 
5 



SEQ 25/41 



BNSDOCID <WO 0052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



<210> 81 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 127... 135 

<223> TADG-12 peptide 

<400> 81 

Lys Thr Met Cys Ser Asp Asp Trp Lys 
5 

<210> 82 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 440. . .448 

<223> TADG-12 peptide 

<400> 82 

Phe Leu Asp Trp lie His Glu Gin Met 
5 

<210> 83 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 433... 441 

<223> TADG-12 peptide 

<400> 83 

Val Tyr Thr Arg Val Thr Ser Phe Leu 
5 

<210> 84 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 263... 271 

<223> TADG-12 peptide 

<400> 84 

Leu Tyr Leu Pro Lys Ser Trp Thr lie 
5 

<210> 85 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 169... 177 

<223> TADG-12 peptide 

<400> 85 



SEQ 26/41 



wo 00/52044 PCT/USOO/0561 2 



Glu Phe Val Ser lie Asp His Leu Leu 
5 



<210> 86 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 296... 304 

<223> TADG-12 peptide 

<400> 86 



Lys Tyr Lys Pro Lys Arg Leu Gly Asn 
5 



<210> 87 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 16... 24 

<223> T7U:iG-12 peptide 

<400> 87 



Arg Ser Leu Phe Gly Leu Asp Asp Leu 
5 



<210> 88 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 267.,. 275 

<223> TADG-12 peptide 

<400> 8 8 



Lys Ser Trp Thr He Gin Val Gly Leu 
5 



<210> 89 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 81. . .89 

<223> TADG-12 peptide 

<400> 89 



Arg Ser Ser Phe Lys Cys He Glu Leu 
5 



<210> 90 
<211> 9 
<212> PRT 



SEQ 27/41 



BNSDOCID -cWO O052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



Homo sapiens 

375. . .383 
TAr)G-12 peptide 
90 



<213> 
<220> 
<222> 
<223> 
<400> 

Val Tyr Gly Gly lie 
5 



lie Ser Pro Ser 



<210> 91 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 110 . . .118 

<223> TADG-12 peptide 

<400> 91 

Arg Val Gly Gly Gin Asn Ala Val Leu 
5 

<210> 92 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 189 . . .197 

<223> TADG-12 peptide 

<400> 92 

Val Tyr Val Arg Glu Gly Cys Ala Ser 
5 

<210> 93 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 165 . . .173 

<223> TADG-12 peptide 

<400> 93 

Gin Phe Arg Glu Glu Phe Val Ser He 
5 

<210> 94 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 10... 18 

<223> TADG-12 peptide 

<400> 94 

Glu Ala Pro Phe Ser Phe Arg Ser Leu 
5 



SEQ 28/41 



wo 00/52044 



PCT/USOO/05612 



<210> 95 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 407... 415 

<223> TADG-12 peptide 

<400> 95 

Cys Gin Glu Arg Arg Leu Trp Lys Leu 
5 

<210> 96 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapi ens 
<220> 

<222> 381... 389 

<223> TADG-12 peptide 

<400> 96 

Ser Pro Ser Met Leu Cys Ala Gly Tyr 
5 

<210> 97 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapi ens 
<220> 

<222> 375 . . .383 

<223> TADG-12 peptide 

<400> 97 

Val Tyr Gly Gly lie lie Ser Pro Ser 
5 

<210> 98 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 381 . . . 389 

<223> TADG-12 peptide 

<400> 98 

Ser Pro Ser Met Leu Cys Ala Gly Tyr 
5 

<210> 99 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 362 . . .370 

<223> TADG-12 peptide 



SEQ 29/41 



BNSDOCID <WO 0O52044A1 I > 



wo 00/52044 



PCT/USOO/05612 



<400> 99 

Val Pro Leu lie Ser Asn Lys lie Cys 
5 

<210> 100 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapl ens 

<220> 

<222> 373 . . .381 

<223> TADG-12 peptide 

<400> 100 

Arg Asp Val Tyr Gly Gly He He Ser 
5 

<210> 101 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 283 . . .291 

<223> TADG-12 peptide 

<400> 101 

Ala Pro Ser His Leu Val Glu Lys He 
5 

<210> 102 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 177 . . .185 

<223> TAr)G-12 peptide 

<400> 102 

Leu Pro Asp Asp Lys Val Thr Ala Leu 
5 

<210> 103 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 47... 55 

<223> TADG-12 peptide 

<400> 103 

Glu Val Phe Ser Gin Ser Ser Ser Leu 
5 

<210> 104 
<211> 9 
<212> PRT 



SEQ 30/41 



wo 00/52044 



PCT/USOO/05612 



<213> Homo sapiens 
<220> 

<222> 36 . . .44 

<223> TADG-12 peptide 

<400> 104 

Val Ala Ala Gin lie Leu Ser Leu Leu 
5 

<210> 105 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 255... 263 

<223> TADG-12 peptide 

<400> 105 

Thr Ala Ala His Cys Val Tyr Asp Leu 
5 

<210> 106 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 138 . . . 146 

<223> TADG-12 peptide 

<400> 106 

Tyr Ala Asn Val Ala Cys Ala Gin Leu 
5 

<210> 107 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 195 . . .203 

<223> TADG-12 peptide 

<400> 107 

Cys Ala Ser Gly His Val Val Thr Leu 
5 

<210> 108 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 215. . .223 

<223> TADG-12 peptide 

<400> 108 

Ser Ser Arg lie Val Gly Gly Asn Met 
5 



SEQ 31/41 



BNSDOCID <WO CK)52044A1 I > 



wo 00/52044 PCT/USOO/05612 



<210> 109 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 298. . .306 

<223> TADG-12 peptide 

<400> 109 



Lys Pro Lys Arg Leu Gly Asn Asp lie 
5 



<210> 110 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<22Q> 

<222> 313... 321 

<223> TADG-12 peptide 

<400> 110 



Gly Pro Leu Thr Phe Asn Glu Met lie 
5 



<210> 111 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 108. . .116 

<223> TADG-12 peptide 

<400> 111 



Cys Val Arg Val Gly Gly Gin Asn Ala 
5 



<210> 112 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 294... 302 

<223> TADG-12 peptide 

<400> 112 



His Ser Lys Tyr Lys Pro Lys Arg Leu 
5 



<210> 113 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 265... 273 

<223> TADG-12 peptide 



SEQ 32/41 



BNSDOCID: <WO__0052044A1„I_> 



wo 00/52044 



PCTAJSOO/05612 



<400> 113 

Leu Pro Lys Ser Trp Thr lie Gin Val 
5 

<210> 114 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 88 . . .96 

<223> TADG-12 peptide 

<400> 114 

Glu Leu He Thr Arg Cys Asp Gly Val 
5 

<210> 115 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 79. . .87 

<223> TADG-12 peptide 

<400> 115 

Arg Cys Arg Ser Ser Phe Lys Cys He 
5 

<210> 116 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 255 . . .263 

<223> TADG-12 peptide 

<400> 116 

Thr Ala Ala His Cys Val Tyr Asp Leu 
5 

<210> 117 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 207. . .215 

<223> TADG-12 peptide 

<400> 117 

Ala Cys Gly His Arg Arg Gly Tyr Ser 
5 

<210> 118 

<211> 9 

<212> PRT 



SEQ 33/41 



wo 00752044 



PCT/USOO/05612 



<213> Homo sapiens 
<220> 

<222> 154. . .162 

<223> TADG-12 peptide 

<400> 118 

Ser Asp Asn Leu Arg Val Ser Ser Leu 
5 

<210> 119 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 300. . .308 

<223> TADG-12 peptide 

<400> 119 

L»ys Arg Leu Gly Asn Asp lie Ala Leu 
5 

<210> 120 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 435. . .443 

<223> TADG-12 peptide 

<400> 120 

Thr Arg Val Thr Ser Phe Leu Asp Trp 
5 

<210> 121 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 376,.. 384 

<223> TADG-12 peptide 

<400> 121 

Tyr Gly Gly lie lie Ser Pro Ser Met 
5 

<210> 122 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 410. . .418 

<223> TADG-12 peptide 

<400> 122 

Arg Arg Leu Trp Lys Leu Val Gly Ala 
5 



SEQ 34/41 



wo 00/52044 



PCT/USOO/05612 



<210> 123 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 210. . .218 

<223> TADG-12 peptide 

<400> 123 

His Arg Arg Gly Tyr Ser Ser Arg lie 
5 

<210> 124 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 109. . .117 

<223> TADG-12 peptide 

<400> 124 

Val Arg Val Gly Gly Gin Asn Ala Val 
5 

<210> 125 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 191 . . .199 

<223> TADG-12 peptide 

<400> 125 

Val Arg Glu Gly Cys Ala Ser Gly His 
5 

<210> 126 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 78. . .86 

<223> TADG-12 peptide 

<400> 126 

Tyr Arg Cys Arg Ser Ser Phe Lys Cys 
5 

<210> 127 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 113 . . .121 

<223> TADG-12 peptide 



SEQ 35/41 



BNSDOCID <WO 0052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



<400> 127 

Gly Gin Asn Ala Val Leu Gin Val Phe 
5 

<210> 128 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 91, . ,99 

<223> TADG-12 peptide 

<400> 128 

Thr Arg Cys Asp Gly Val Ser Asp Cys 
5 

<210> 129 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapl ens 

<220> 

<222> 38 . . .46 

<223> TADG-12 peptide 

<400> 129 

Ala Gin lie Leu Ser Leu Leu Pro Phe 
5 

<210> 130 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 211. . .219 

<223> TADG-12 peptide 

<400> 130 

Arg Arg Gly Tyr Ser Ser Arg lie Val 
5 

<210> 131 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 216 . . .224 

<223> TADG-12 peptide 

<400> 131 

Ser Arg lie Val Gly Gly Asn Met Ser 
5 

<210> 132 
<211> 9 
<212> PRT 



SEQ 36/41 



wo 00/52044 



PCT/USOO/05612 



<213> Homo sapiens 
<220> 

<222> 118. . . 126 

<223> TADG-12 peptide 

<400> 132 

Leu Gin Val Phe Thr Ala Ala Ser Trp 
5 

<210> 133 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 370... 378 

<223> TADG-12 peptide 

<400> 133 

Cys Asn His Arg Asp Val Tyr Gly Gly 
5 

<210> 134 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapl ens 
<220> 

<222> 393... 401 

<223> TADG-12 peptide 

<400> 134 

Gly Val Asp Ser Cys Gin Gly Asp Ser 
5 

<210> 135 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 235. . .243 

<223> TADG-12 peptide 

<400> 135 

Leu Gin Phe Gin Gly Tyr His Leu Cys 
5 

<210> 136 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapl ens 
<220> 

<222> 427... 435 

<223> TADG-12 peptide 

<400> 136 

Ala Glu Val Asn Lys Pro Gly Val Tyr 
5 



SEQ 37/41 



BNSDOCID <WO 0052044A1 I > 



wo 00/52044 



PCT/USOO/05612 



<210> 137 

<211> 9 

<212> PRT 

<2 1 3 > Homo sapi ens 

<220> 

<222> 162 . . .170 

<223> TADG-12 peptide 

<400> 137 



Leu Glu Gly Gin Phe Arg Glu Glu Phe 
5 



<210> 138 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 9... 17 

<223> TADG-12 peptide 

<400> 138 



Val Glu Ala Pro Phe Ser Phe Arg Ser 
5 



<210> 139 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 318. . .326 

<223> TADG-12 peptide 

<400> 139 



Asn Glu Met lie Gin Pro Val Cys Leu 
5 



<210> 140 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 256. . .264 

<223> TADG-12 peptide 

<400> 140 



Ala Ala His Cys Val Tyr Asp Leu Tyr 
5 



<210> 141 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 46... 54 

<223> TADG-12 peptide 



SEQ 38/41 



wo 00/52044 



PCT/USOO/05612 



<400> 141 

Phe Glu Val Phe Ser Gin Ser Ser Ser 
5 

<210> 142 

<211> 9 

<212> PRT 

< 2 1 3 > Homo sapi ens 

<220> 

<222> 64 . . .72 

<223> TADG-12 peptide 

<400> 142 

Leu Ala lie Gly Leu Gly lie His Phe 
5 

<210> 143 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 192 . . .200 

<223> TADG-12 peptide 

<400> 143 

Arg Glu Gly Cys Ala Ser Gly His Val 
5 

<210> 144 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 330. . .338 

<223> TADG-12 peptide 

<400> 144 

Glu Glu Asn Phe Pro Asp Gly Lys Val 
5 

<210> 145 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 182 . . .190 

<223> TADG-12 peptide 

<400> 145 

Val Thr Ala Leu His His Ser Val Tyr 
5 

<210> 146 
<211> 9 
<212> PRT 



SEQ 39/41 



BNSDOCtD <WO CX>52044A1 I > 



wo 00/52044 



PCT/USOO/05612 



<213> Homo sapiens 
<220> 

<222> 408... 416 

<223> TADG-12 peptide 

<400> 146 

Gin Glu Arg Arg Leu Trp Lys Leu Val 
5 

<210> 147 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<22.0> 

<222> 206 . . .214 

<223> TADG-12 peptide 

<400> 147 

Thr Ala Cys Gly His Arg Arg Gly Tyr 
5 

<210> 148 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 5... 13 

<223> TADG-12 peptide 

<400> 148 

Asp Pro Pro Ala Val Glu Ala Pro Phe 
5 

<210> 149 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 261... 269 

<223> TADG-12 peptide 

<400> 149 

Tyr Asp Leu Tyr Leu Pro Lys Ser Trp 
5 

<210> 150 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 33... 41 

<223> TADG-12 peptide 

<400> 150 

Ala Asp Ala Val Ala Ala Gin lie Leu 
5 



SEQ 40/41 



wo 00/52044 
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<210> 151 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 168... 176 

<223> TADG-12 peptide 

<400> 151 



Glu Glu Phe Val Ser lie Asp His Leu 
5 



<210> 152 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 304... 312 

<223> TADG-12 peptide 

<400> 152 



Asn Asp lie Ala Leu Met Lys Leu Ala 
5 



<210> 153 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<222> 104... 112 

<223> TADG-12 peptide 

<400> 153 



Asp Glu Tyr Arg Cys Val Arg Val Gly 
5 
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